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Chimeric D NA-bindincr/DNA Methvltransf erase Nucleic Acid and 
Polypeptide and Uses Thereof 

5 This application claims the benefit of U.S. Provisional 
Application No. 60/004,445, filed September 28, 1995, and 
this application is a continuation-in-part of U.S. Serial 
No. 08/594,866, filed January 31, 1996, the contents of 
which are hereby incorporated by reference into the present 
10 application. 

Background of the Invention 

Throughout this application, various publications are 
15 referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the end 
of the specification immediately preceding the claims. The 
disclosures of these publications in their entireties are 
hereby incorporated by reference into this application in 
20 order to more fully describe the state of the art as known 
to those skilled therein as of the date of the invention 
described and claimed herein. 

All mammalian promoters tested to date have been found to be 
25 silenced when they contain 5-methylcytosine (m 5 C) at CpG 
sites (reviewed by Bestor, 1990; Meehan et al., 1993). 
Methylation represses transcription directly by interference 
with binding of transcription factors (Joel et al., 1993, 
and references therein) and methylated sequences are 
30 assembled into condensed chromatin that is inaccessible to 
transcription factors (reviewed by Bird, 1992) . It has been 
shown that methylation of CpG dinucleotides in the 5' long 
terminal repeat (LTR) of HIV-1 suppresses viral 
transcription (reviewed by Bednarik, 1992) . Methylation can 
35 reduce transcription from the HIV-1 5' LTR to undetectable 
levels (Joel et al., 1993). It is also known that 
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methylation patterns are transmitted by clonal inheritance 
(Wigler, 1981), and that this is due to the strong 
preference of mammalian DNA methyltransf erase for 
hemimethylated DNA (Bestor and Ingram, 1983; Bestor, 1992). 
5 The heritability of methylation patterns causes the affected 
promoters to be irreversibly inactivated, as has been 
observed for a large number of genes in cultured cells 
(reviewed by Holliday, 1993) and in human tissues (Yoshiura 
et al., 1995; Herman et al . , 1994). While the exact 
10 function of DNA methylation remains the subject of 
discussion, its ability to suppress transcription is not in 
doubt . 

Most exogenous nucleic acid sequences that a resident in a 

15 cell such as transposable elements and proviral DNA are 
methylated and inactivated in the genomes of mammals, 
flowering plants, and those fungi whose genomes contain m 5 C. 
Retroviral DNA is especially prone to inactivation by de 
novo methylation upon germline transmission, and this has 

20 greatly reduced the usefulness of retroviral transducing 
vectors in the construction of transgenic animals (Jahner 
and Jaenisch, 1984; Jaenisch et al., 1985). Treatment of 
cultured cells or mice with the demethylating drug 5- 
azacytidine (an irreversible inhibitor of DNA 

25 methyltransf erases) can reactivate methylated retrovirus 
genomes (Jaenisch et al., 1985), and endogenous genes that 
have been silenced by ectopic de novo methylation of 
promoter regions can also be reactivated by demethylation 
(Holliday, 1993; Yoshiura et al., 1995). It is notable that 

30 organisms whose DNA lacks m 5 C (such as Drosophila) suffer 
far larger numbers of insertion mutations (Ashburner, 1992) , 
which may reflect a reduced capacity for the control of 
mobile elements.. These observations, together with 
evolutionary considerations (Bestor, 1990), strongly suggest 

35 that cytosine methylation is part of a genomic host defense 
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system that limits the proliferation of parasitic sequence 
elements (Bestor, 1990; Bestor and Coxon, 1993) . The 
selective advantage of such a defensive system is obvious, 
given that a significant fraction of the genome represents 
5 exogenous sequences that are invisible to the immune system 
and which might impose a lethal mutagenic or cytotoxic 
burden if allowed to proliferate unchecked. 

The proviral DNA of many retroviruses is propagated in the 
10 repressed, latent state as a result of methylation of LTR 
sequences (reviewed by Jahner and Jaenisch, 1984) . A body 
of evidence indicates methylation can cause latency in cells 
infected with HTLV-1 (Cassens and Ullrich, 1993) or HIV-1 
(Bednarik et al., 1990; Joel et al . , 1993 reviewed by 
15 Bednarik, 1992) . The importance of methylation in latent 
HIV-1 infections in patients is unknown, but as HIV has only 
recently entered human populations, the host -parasite 
relationship may be far from equilibrium (Baltimore, 1995) 
and HIV proviral DNA may be inactivated with low efficiency 
20 under normal conditions (McCune, 1995) . It is clear, 
however, that HIV-1 transcription is very sensitive to 
methylation of 5' LTR sequences (reviewed by Bednarik, 
1992) . 

25 
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Summary of the Invention 

The present invention provides a chimeric protein which 
5 comprises a mutated DNA methyltransf erase portion and a DNA 
binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 

10 expression of the target gene. This invention also provides 
for a method for inhibiting the expression of a target gene 
which includes contacting a promoter of the target gene with 
the chimeric protein, so as to specifically methylate the 
promoter sequence of the target gene thus inhibiting 

15 expression of the target gene. 



WO 97/11972 



PCT/US96/15576 



-5- 

Brief Description of the Fimir flB 

Figure 1. Distribution of CpG sites and transcription 
factor binding sites in the 5' LTR of HIV-1 (Sequence I.D. 

5 No. 1). CpG sites that fall within the recognition sequence 
of a restriction methyltransf erase or methylation-sensitive 
endonuclease are shown above downward-pointing arrows. Data 
are from Garcia and Gaynor, 1994. 

10 Figure 2. Sequence of HIV-1 5' LTR. CpG dinucleotides are 
bold and underlined; CpG sites that lie within the 
recognition sequence of methylation-sensitive restriction 
endonuclease are bold and italicized. Major transcription 
start site is denoted by +1, and the underlined sequence 

15 following denotes the transcribed sequence; numbering is 
with respect to the transcription start site. Please refer 
to Figure 1 for location of protein binding sites. Strike 
through indicates the 9 bp binding site for the 3 finger 
protein developed by Wu et al. (1995) . Sequence and 

20 numbering from Garcia and Gaynor, 1994, 

Figure 3. Gene silencing via targeted DNA methylation. A 

chimera between a sequence-specific DNA binding protein and 
a DNA methyltransferase with attenuated DNA binding directs 
25 methylation only to sites in the vicinity of the recognition 
sequence of the DNA binding protein. 

Figure 4 . Domain organization of mammalian DNA methyl- 
transferase. This enzyme is comprised of a long N- terminal 
30 regulatory domain and a C-terminal catalytic domain that 
closely resembles bacterial restriction methyltransferase. 
See Bestor and Verdine (1994) for a recent review. 



Figure 5 . Protein-DNA contacts in the M. Haelll-DNA 
3 5 transition state intermediate. The target cytosine is 
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everted from the helix during methyl transfer, and is shown 
in the extra helical position. Thin dotted lines indicate 
base-specific contacts in the major groove, bold lines 
indicate contacts with the sugar -phosphate backbone. 
5 Mutation of residues involved in the sugar-phosphate 
contacts will reduce the affinity of DNA methyl -transferases 
for DNA, as will framework mutations that alter the 
disposition of the contact residues. Such mutations will 
make methyl transfer dependent on the DNA binding protein 
10 moiety of a DNA binding protein/DNA methyltransf erase 
chimera, as described herein. Data from Reinisch et al . 
(1995) . 

Figure 6 . pLS, an expression construct to be used in the 
15 selection of DNA binding protein/DNA methyltransf erase 
chimeras that methylate predetermined sequences. Lex A was 

chosen for its ability to direct fused proteins to Lex A 
binding sites on DNA (Brent and Ptashne, 1986), and M.SssI 
was chosen because of its high specific activity and the 
20 fact that it is the only bacterial methyltransf erase that 
has the same specificity (5'-CpG-3') (Renbaum and Razin, 
1992) as the mammalian enzyme (Bestor and Ingram, - 1983 ) / 

Figure 7. Cyclic in vitro/in vivo selection for DNA binding 
2 5 protein/DNA methyltransf erase chimeras in which methylation 
is dependent on the DNA binding protein moiety. McrBC en- 
codes a nuclease that degrades heavily methylated DNA. 
Unmethylated DNA that has been linearized by a methylation- 
sensitive endonuclease does not transform. Use of these 
30 selective procedures on combinatorial libraries allows 
selection of chimeric DNA binding protein-DNA 
methyltransferase that target methylation only to a unique 
target site. This selection scheme may be applied to 
LexA/DNA methyltransferase chimeras (Example 1) and to zinc 
35 finger/DNA methyltransferase chimeras (Examples 3-5) . 
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Figure 8. Identification of CpG sites in the HIV-1 5' LTR 
that yield maximal suppression of transcription when methy- 
lated. Individual 20mer primers that contain single m 5 CpG 
5 sites are hybridized to single-stranded M13 clones that 
contain the HIV-1 LTR and the bacterial CAT reporter gene. 
Extension of the primers with a DNA polymerase (sequenase) 
yields double -stranded DNA that is methylated on one strand 
at one CpG site. Upon transfection the hemimethylated site 
10 rapidly becomes methylated (Busslinger et al., 1983). The 
positions of each of the 11 CpG sites in the HIV-1 5' LTR 
are given with respect to the transcription start site in 
the HIV-1 genome (Garcia and Gaynor, 1993) . 

15 Figure 9 . Phage-display selection of a zinc finger 

proteins that bind to predetermined sequences in the HIV-1 
5' LTR. The strategy used to construct tridactyl zinc 
finger proteins that bind to specific sequences was 
independently developed by the laboratories of Berg 

20 (Desjarlais and Berg, 1993), Klug (Choo and Klug, 1994), 
Pabo (Rebar and Pabo, 1994), and Barbas {Wu et al . , 1994). 
The schemes are very similar; the one depicted here is that 
of Choo and Klug (1994) . The method has been used to obtain 
proteins that bind to a number of predetermined sequences 

25 with high specificity and affinity. The boxed sequence 
shown in the figure appears 20 base pairs 5' of the first 
Hpall (CCGG) site in the LTR sequence of Figure 2; it is 
shown for illustrative purposes only (Sequence I.D. No. 2). 
The actual target sequence will be determined as described 

30 in Example 2. 

Figure 10. Organization and transcription of the HBV genome. 

The viral genome of 3.2 kb is represented by a horizontal 
line. Positions of methylatable CpG sites are shown by 
3 5 vertical lines. Arrows indicate major viral transcripts. 
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Notice that transcription of each major mRNA initiates in a 
cluster of CpG sites. CpG methylation has been shown to 
suppress viral transcription in transgenic animals (Miller 
and Robinson, 1993; Pourcel et al., 1990). 

5 

Figures 11A - 11B. Targeted methylation of a preselected CpG 
site. 

(A) A diagram of pLS which encodes a lexA-M.SssI fusion 
protein. A Smal site is located adjacent ot a lexA binding 

10 site in the same plasmid. LexA is predicted to position 
M.SssI over this site, and preferential methylation of 
theSmal site was expected (M.SssI has no intrinsic 
preference for Smal sites) . The positions of other 
restriction sites are shown approximately to scale; other 

15 features of the plasmid are omitted for the sake of clarity. 

(B) Data confirms targeted methylation. Odd-numbered lanes 
contain plasmid that encodes enzymatically active fusion 
protein. The unmethylated state cuses the DNA to be cleaved 
to the linear form. Notice that the Smal site is largely 

2 0 resistant to cleavage in lane 8 (compare to lanes 7 and 10) , 
while the other sites remain sensitive. The methylation- 
insensitive endonulease Ncol was used as a control. Heavy 
methylation is observed only at the Smal site; little or no 
detectable methylation is seen at Sail and Xhol sites (lanes 

25 4 and 6) . Lin: linear unit-length DNA (unmethylated) . CCC, 
covalently closed circular DNA (methylated in lane 88) . 
Lane 11 contains length markers. The diffuse band that 
migrates above the largest length marker is bacterial 
chromosomal DNA. 

30 

Figure 12. Construction of a plasmid that performs targeted 
methylation. pLM9 was constructed by cloning an Lac 
repressor-M.SssI fusion gene between the Pvull sites of 
pBluescript. Adjacent to the fusion gene is a synthetic 
35 methylation target sequence, LacO-CpGs, which contains the 



WO 97/11972 



PCT/US96/15576 



-9- 

Lac operator (to which Lac repressor binds) and a number of 
CpG target sites. These are 12 amino acids in the linker 
between Lad and M.SssI. Expression of the fusion is 
controlled by the native promoter of lad gene. 

5 

Figure 13. Expression of LacI-M.SssI fusion in E. coli XL1- 
Blue [pLM9] . The targeted methyltransf erase is marked with 
an arrow. The antibody used was anti-Lad. An intact 
fusion protein was produced in good yield. Lane marked pBS 
10 contained proteins extracted from cells that contained only 
the expression vector. 

Figure 14. Targeted methylation demonstrated by bisulfite 
sequencing analysis. The methylation target shown in Figure 

15 12 was isolated from cells expressing Lad -M.SssI fusion 
proteins. The bisulfite sequencing method was used to 
identify methylated sites. pLM9-l and -2 are two sister 
bisulfite cones carrying BS-modified methylation target 
sequence derived from a pLM9 mutant that has attenuated 

20 activity. Methylation is limited to CpG sites in the 
immediate vicinity of the binding site of the sequence 
specific DNA binding protein Lad. pBS is the precursor of 
pLM9 and has no lad/M.SssI gene; no methylated sites are 
present. pM is a pBS derivative that encodes fully active 

25 (non- targeted) M.SssI gene; all sites are methylated. These 
data confirm that targeted methylation has been used to 
direct methylation to CpG sites in the vicinity of the 
binding site of a sequence-specific DNA binding protein. 

3 0 Figure 15. Zinc- finger (Zif) targeted methyltransf erases . 

The above Zif fusion constructs have been made and 
expression of the appropriate fusion proteins has been 
confirmed by immunoblot and methylation analysis. The 
coding regions have also been transferred into the mammalian 
35 expression vector pcDNA3 . 1/His/A. 
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Figure 16. HIV-1 LTR constructs used to analyze targeted 
methylation. Plasmid pLTR-CAT (top) contains the U3 region 
and 80% of the R region of the HIV-1 proviral long terminal 
repeat (LTR) driving expression of the chloramphenicol 
5 acetyltransf erase (CAT) reporter gene. These elements 
contain the HIV-1 core promoter, enhancer, negative 
regulatory element, and trans-activated region (TAR). These 
elements also contains 10 CpG dinucleotides (shown as 
circles above the LTR), which are targets for methylation. 

10 Plasmid pLG-luc (bottom) contains the entire proviral LTR 
and gag leader sequence (GLS) driving expression of the 
firefly lucif erase reporter gene. The 150 bp GLS, which 
lies between the LTR and the first viral gene, contains an 
additional 14 CpG dinucleotides and has been shown to be 

15 important for viral expression. The circles are shaded (see 
box) to indicate the conservation of the individual CpG 
sites in different HIV-1 sequences of class B, the class 
affecting the vast majority of infected individuals in the 
United States and Western Europe. Transcription factor 

20 binding sites are labeled below the LTR and hatched when 
they contain a CpG site. The LTR is flanked by a short 
region of human genomic DNA present in the original HIV 
clone. 

25 Figure 17. Complete suppression of HIV-1 transcription by 
LTR methylation. Methods: pLTR-CAT was treated with M.SssI 
to methylate all CpG sites. Methylated or unmethylated 
pLTR-CAT was cotransf ected with unmethylated pLG-luc into 
HLtat cells and lysates were prepared at the times 

3 0 indicated. To control for the efficiency of transfection 
and recovery, volumes of lysate containing equivalent 
luciferase activity were assayed for CAT activity using the 
FASTCAT assay kit. Reaction products were quantitated using 
NIH Image. "CAT Activity" indicates conversion of 1- 

3 5 deoxychloramphenicol to 3 - acetyl - 1 -deoxychloramphenicol , 
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normalized to arbitrary units. Control cells were 
transfected with pLG-luc only and harvested 48 hours post- 
transf ection. 
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Detailed Description of the Invention 

The present invention provides for a chimeric protein which 
comprises a mutated DNA methyl transferase portion and a DNA 
5 binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 
expression of the target gene. 

10 

As used herein, "chimeric protein" is a protein including at 
least two portions of two proteins positioned adjacent to 
each other so that the coding sequences of both portions are 
in frame and can be translated into one polypeptide. The 
15 chimeric protein may include a portion of a DNA binding 
protein and a portion of a mutated or wild type DNA 
methyl transferase protein. The two portions of the chimeric 
protein may be linked with intermediate sequences of 8,12,16 
or 20 codons expressed as part of the chimeric protein. 

20 

As used herein, "adjacent" includes two sequences that are 
positioned directly next to one another so that the codons 
of each sequence are in frame and contiguous. "Adjacent" 
also includes two sequences that are nearby one another 
25 linked to intermediate sequences and are in frame and can be 
translated into one polypeptide. 

As used herein "linked to" encompasses a means of joining 
two separate portions of a protein, each of which may have 

3 0 a separate function. The means of joining the functional 
portions of the protein may include a covalent or 
noncovalent association between functional polypeptides via 
an organic substance, an inorganic substance, a nucleotide, 
a polynucleotide, a peptide nucleic acid, a peptide, a 

35 triplex peptide, an electrostatic means, a triplex nucleic 
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acid, or a self -assembling biomaterial. 

As used herein, "DNA methyltransf erase" is a protein which 
is capable of methylating a particular DNA sequence, which 
5 particular DNA sequence may be -CpG- . This protein may be 
a mutated DNA methyltransf erase, a wild type DNA 
methyltransf erase, a naturally occurring DNA 
methyltransf erase, a variant of a naturally occurring DNA 
methyltransf erase, a truncated DNA methyltransf erase, or a 

10 segment of a DNA methyltransf erase which is capable of 
methylating DNA. The DNA methyltransf erase may include 
mammalian DNA methyltransf erase, bacterial DNA 
methyltransf erase, M.SssI DNA methyltransf erase and other 
proteins or polypeptides that have the capability of 

15 methylating DNA. 

As used herein, "mutated DNA methyltransf erase protein 
portion" is a segment of a DNA methyltransf erase protein or 
polypeptide containing an amino acid sequence which is 

20 different from the wild type DNA methyltransf erase amino 
acid sequence. A mutated DNA methyltransf erase protein 
portion may include a segment of a naturally occurring 
variant of the wild type DNA methyltransf erase protein, a 
segment of a DNA methyltransf erase with a sequence altered 

25 from the wild type sequence, a truncated protein or portion 
of protein or polypeptide that is capable of methylating 
DNA. This mutated DNA methyltransf erase may be altered at 
one or more of its amino acids and retains part of the wild 
type methyltransferase activity, although the activity of 

30 the mutated DNA methyltransferase protein portion may be 
attenuated activity. 

As used herein, "DNA binding protein portion" is a segment 
of a DNA binding protein or polypeptide capable of 
35 specifically binding to a particular DNA sequence. The 
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binding is specific to a particular DNA sequence site. The 
DNA binding protein portion may include a truncated segment 
of a DNA binding protein or a fragment of a DNA binding 
protein. 

5 

As used herein, "binds sufficiently close" means the 
contacting of a DNA molecule by a protein at a position on 
the DNA molecule near enough to a predetermined methylation 
site on the DNA molecule to allow proper functioning of the 
10 protein and allow specific methylation of the predetermined 
methylation site. 

As used herein, M a promoter sequence of a target gene" is at 
least a portion of a non-coding DNA sequence which directs 
15 the expression of the target gene. The portion of the non- 
coding DNA sequence may be in the 5' -prime direction or in 
the 3 '-prime direction from the coding region of the target 
gene. The portion of the non-coding DNA sequence may be 
located in an intron of the target gene. 

20 

The promoter sequence of the target gene may be a 5 ' long 
terminal repeat sequence of a human immunodeficiency virus -1 
proviral DNA. The target gene may be a retroviral gene, an 
adenoviral gene, a foamy viral gene, a parvo viral gene, a 
25 foreign gene expressed in a cell, an overexpressed gene, or 
a misexpressed gene. 

As used herein "specifically methylate" means to bond a 
methyl group to a methylation site in a DNA sequence, which 
30 methylation site may be -CpG-, wherein the methylation is 
restricted to particular methylation site(s) and the 
methylation is not random. 

In one embodiment of this invention, the chimeric protein 
35 may be a zinc three -finger DNA binding polypeptide linked to 
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a CpG-specific DNA methyltransf erase protein portion. The 
chimeric protein may comprise a mutated Lex A DNA binding 
protein or polypeptide linked to a cytosine 
methyltransf erase protein portion or polypeptide. 

The chimeric protein may include at least a portion of a 
mutated M.SssI DNA methyltransf erase protein or at least a 
portion of a mutated mammalian DNA methyltransf erase . 

In one embodiment of this invention, an expression vector 
may code for the chimeric protein described hereinabove. 
The expression vector may be a replicable vector and may 
include a pLS vector, a prokaryotic expression vector, a 
yeast expression vector, a baculovirus expression vector, a 
mammalian expression vector, or an episomal mammalian 
expression vector. 



As used herein, "expression vector" is a nucleic acid 
molecule which is capable of having a coding sequence of DNA 
20 or RNA ligated into it and then expressing that sequence 
when introduced into an appropriate host cell . An 
expression vector may be a mammalian expression vector or a 
prokaryotic expression vector. 

25 Another embodiment of this invention is a method for 
inhibiting the expression of a target gene which includes 
contacting a promoter sequence of the target gene with the 
chimeric protein described hereinabove, so as to 
specifically methylate the promoter sequence of the target 

30 gene thus inhibiting expression of the target gene. In this 
embodiment, the target gene may be an endogenous target gene 
which is native to a cell or a foreign target gene. The 
foreign gene may be a retroviral target gene or a viral 
target gene. 



35 
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The target gene in this embodiment may be associated with a 
cancer, a central nervous system disorder, a blood disorder, 
a metabolic disorder, a cardiovascular disorder, an 
autoimmune disorder, or an inflammatory disorder. The 
5 cancer may be acute lymphocytic leukemia, acute myelogenous 
leukemia, B-cell lymphoma, lung cancer, breast cancer, 
ovarian cancer, prostate cancer, lymphoma, Hodgkin's 
disease, malignant melanoma, neuroblastoma, renal cell 
carcinoma or squamous cell carcinoma. The central nervous 

10 system disorder may be Alzheimer's disease, Down's syndrome, 
Parkinson's disease, Huntington's disease, schizophrenia, or 
multiple sclerosis. The infectious disease may be 
cytomegalovirus, herpes simplex virus, human 
immunodeficiency virus, AIDS, papillomavirus, influenza, 

15 Candida albicans, mycobacteria, septic shock, or associated 
with a gram negative bacteria. The blood disorder may be 
anemia, hemoglobinopathies, sickle cell anemia, or 
hemophilia. The cardiovascular disorder may be familial 
hypercholesterolemia, atherosclerosis, or renin/angiotensin 

20 control disorder. 

The metabolic disorder may be ADA, deficient SCID, diabetes, 
cystic fibrosis, Gaucher' s disease, galactosemia, growth 
hormone deficiency, inherited emphysema, Lesch-Nyhan 
2 5 disease, liver failure, muscular dystrophy, phenylketonuria, 
or Tay-Sachs disease. The autoimmune disorder may be 
arthritis, psoriasis, HIV, or atopic dermatitis. The 
inflammatory disorder may be acute pancreatitis, irritable 
bowel syndrome, Chrone's disease or an allergic disorder. 

30 

In one embodiment of the subject invention promoter 
sequences of the target genes listed below may be 
therapeutic target genes of the subject invention: dominant 
oncogenes, c-MYB, c-MYC, c-SIS, ERB A, ERB B, ERB B-l, ERB 
35 B-2, HER-2/NEU, c-SRC, c-YES, c-FPS, c-FES, c-FOS, c-JUN, c- 
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ROS, c-ABL, c-FGR, c-MOS, c-RAS, c-RAF, c-MET , c-ETS, BCL-1, 
BCL-2, ETS, c-FMS, c-FES, c-BLK, TCL-1, TCL-2, TCL-3, TCL-5, 
ALL-1/HRX/MLL, PML promoter in PML/RAR-a fusion, RAR-a 
promoter in RAR-a/PML fusion, and NF-1. 

5 

Genes that are overexpressed in cancer cells are also target 
genes of the subject invention. Inhibiting the expression 
of these target genes may reduce tumorigenesis and/or 
metastasis and invasion. Cancer related genes include: 
10 collagenase 92 Kd Type 4, collagenase 72 Kd Type 4, 
osteopontin, calcyclin, fibroblast growth factor, epidermal 
growth factor, matrilysin and stromolysin. 

Viruses that establish chronic infections and which are 
15 involved in cancer or chronic diseases are also target genes 
of the subject invention. Virus that have possible target 
genes include hepatitis C, hepatitis B, varicella, herpes 
simplex types I and II, Epstein-Barr virus, cytomegalovirus, 
JC virus and BK virus. 

20 

In a preferred embodiment of the subject invention, the 
promoter sequence of viral protein X and the promoter 
sequence of pre-S2/5 of the hepatitis B virus may be target 
genes . 

25 

In another embodiment of the subject invention, genes whose 
expression is involved in neurodegenerative disease may be 
target genes of the subject invention. These target genes 
may include beta amyloid precursor protein and the prion 

30 protein both of which are shown to be involved in 
Alzheimer's disease. In another embodiment of the subject 
invention, genes which are involved in the etiology of 
acromegaly may be target genes of the subject invention. 
These target genes may include somatostatin, growth- hormone 

35 releasing hormone and constitutive G-stimulatory protein 
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mutants. 

The target gene may be in a cell. The cell may be a 
bacterial cell, an animal cell or a plant cell. The cell 
5 may be a eukaryotic cell, a prokaryotic cell, a virus 
packaging cell, a somatic cell, a germ cell, a neuronal 
cell, a myocyte, a T lymphocyte, a CD4* cell, a tumor cell, 
a CD4+ cell, or a stem cell. 

10 The chimeric protein may be a mutated Lex A DNA binding 
protein portion or polypeptide linked to a cytosine DNA 
methyltransf erase protein portion or polypeptide. 

The DNA methyltransf erase protein portion may be M.SssI DNA 
15 methyltransferase protein or the mammalian DNA 
methyltransf erase protein. The chimeric protein may include 
a tridactyl zinc finger protein or polypeptide capable of 
specifically binding the HIV-1 5'LTR nucleic acid sequence 
linked to a cytosine methyltransferase polypeptide. The 
20 cytosine DNA methyltransferase polypeptide may be a M. SssI 
DNA methyltransferase protein or polypeptide or at least a 
portion of the mammalian DNA methyltransferase protein. 

The contacting may be by means of liposome mediated 
25 delivery, retroviral delivery, gene bombardment, 
electroporation, electronic pulse delivery, air-gun 
injection of a nucleic acid-coated pellet, a self -assembling 
nanocrystalline composition, a cytofectin analog, 
condensation of the nucleic acid, receptor mediated gene 
30 transfer, glycosylated macromolecular carriers, polar 
(glyco) lipids, (glyco) peptides, synthetic polymers, a 
triplex nucleic acid, naked nucleic acid transfer, particle- 
mediated nucleic acid transfer or cationic precipitation. 

35 A further embodiment of this invention is a method for 
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inhibiting expression of a target gene in a multicellular 
organism which includes contacting a promoter sequence of 
the target gene with the chimeric protein described 
hereinabove or a nucleic acid molecule encoding the chimeric 
5 protein, so as to specifically methylate the promoter 
sequence of the target gene and thus inhibit expression of 
the target gene in the multicellular organism. 

The multicellular organism may be a plant, an animal or a 
10 human. The plant may be an alfalfa plant, a broccoli plant, 
a rapeseed plant, a carrot plant, a chicory plant, a coffee 
plant, a cucurbita plant, a euromelon plant, a potato plant, 
a raspberry plant, a sunflower plant, a tomato plant, or a 
wheat plant . 

15 

The administration of the chimeric protein to the 
multicellular organism may comprise intralesional , 
intraperitoneal, intramuscular or intravenous injection; 
liposome -mediated delivery; viral infection; gene 

20 bombardment; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non- integrating, 
replication-defective virus. The virus may comprise a 
replication-defective HIV-l provirus, a retroviral vector, 
an adeno-associated virus, a N2 retroviral vector, a SIM 

25 retroviral vector, a LNL6 vector, a LXSN vector or a MMuLV 
retroviral vector. 

The process by which a plant or animal is rendered resistant 
to viral infection comprises introducing into the plant or 

30 animal a construct which on translation gives rise to the 
above-mentioned chimeric protein. The introduction of the 
nucleic acid molecule is accomplished by genetic 
transformation of a part of the plant by a DNA sequence 
coding for the nucleic acid molecule, followed by the 

35 regeneration of a transgenic plant. The transformation is 
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carried out by the intermediary of Aorrobacterium tum fi f a ripn g 
or Aqrotp^cterium rhizoqenes. (U.S. Patent Nos. 5,107,065 and 
5,188,958) . 

5 The present invention is further directed to a DNA construct 
for a plant, the construct comprising a genetic sequence and 
a promoter capable of directing expression of the genetic 
sequences wherein the genetic sequence on expression 
provides at least a portion of a sequence specific DNA 

10 binding protein linked to at least a portion of a mutated 
DNA methyl transferase protein. The DNA construct may 
further be part of a DNA transfer vector suitable for 
transferring the DNA construct into a plant cell and 
insertion into a plant genome. In an embodiment of the 

15 present invention, the DNA construct is carried by broad 
host range plasmid pGA470 which is capable of transformation 
into plant cells using Aarobacterium . The present 
invention, however, extends to other means of transfer such 
as genetic bullets (e.g. DNA-coated tungsten particles, 

20 high- velocity micro projectile bombardment) and 
electroporation amongst others (Maliga, 1993; Bryant, 1992; 
or Shimamoto, 1989) . 

The gene encoding the chimeric protein, having been 
25 introduced into the nonhuman mammal, or an ancestor of the 
nonhuman mammal at the single cell stage or an embryonic 
stage, is linked to a promoter and integrated into the 
genome of the nonhuman mammal. One skilled in the art would 
be familiar with the experimental methods necessary to 
3 0 produce a transgenic mammal, as described in Leder et al., 
U. S. Patent No. 4,736,866 and Krimpenfort and Berns, U. S. 
Patent No. 5,175,3 84 and Wagner and Chen, U. S. Patent No. 
5,175,385. 

35 A further embodiment of the subject invention is a method of 
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treating a subject infected with a virus which comprises 
administering to the subject a therapeutic composition 
comprising a chimeric protein, or a nucleic acid molecule 
encoding the chimeric protein, capable of sequence specific 
5 methylation of a site(s) in a promoter sequence of a target 
viral gene under suitable conditions so as to specifically 
methylate the promoter sequence of the target gene thereby 
inhibiting expression of the target gene and thus treating 
the subject infected with the virus. Examples of viruses 
10 are listed herein. 

The therapeutic composition may comprise a nucleic acid 
encoding at least a portion of a mutated Lex A DNA binding 
protein or polypeptide linked to at least a portion of a 

15 mutated DNA methyltransf erase protein or polypeptide. The 
therapeutic composition may be a nucleic acid encoding a 
tridactyl zinc finger DNA binding protein or polypeptide 
capable of specifically binding the HIV-1 5' LTR nucleic 
acid sequence linked to at least a portion of a mutated DNA 

20 methyltransf erase protein or polypeptide. The therapeutic 
composition may comprise a nucleic acid encoding at least a 
portion of a sequence specific DNA binding protein or 
polypeptide linked to a mutated DNA methyltransf erase 
protein or polypeptide. 

25 

The subject receiving the therapeutic composition of this 
invention may be a human or a non- human mammal. 

The therapeutic composition may comprise a replicable 
3 0 expression vector chosen from the group consisting of a pLS 
vector, a prokaryotic expression vector, a yeast expression 
vector, a baculovirus expression vector, a mammalian 
expression vector, and an episomal mammalian expression 
vector. The administration may comprise intralesional , 
35 intraperitoneal, intramuscular or intravenous injection; 
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liposome -mediated delivery; viral infection; gene 
bombardment; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non- integrating, 
replication-defective virus. 

5 

In one preferred embodiment of the method above the nucleic 
acid molecule encoding a chimeric protein is incorporated 
into a liposome to allow for administration to the subject. 
Methods of incorporation of nucleic acid molecules into 

10 liposomes are well known to those of ordinary skill in the 
art. In another embodiment of this method, the nucleic acid 
encoding the chimeric protein may be delivered via 
transfection, injection, or viral infection. There are 
several protocols for human gene therapy which have been 

15 approved for use by the Recombinant DNA Advisory Committee 
{RAC) which conform to a general protocol of target cell 
infection and administration of transfected cells (see for 
example, Blaese, R.M. , et al., 1990; Anderson, W. F. , 1992; 
Culver, K.W. et al., 1991). In addition, U.S. Patent No. 

20 5,399,346 (Anderson, W. F. et al . , issued March 21, 1995) 
describes procedures for retroviral gene transfer. The 
contents of these support references are incorporated in 
their entirety into the subject application. Retroviral - 
mediated gene transfer requires target cells which are 

25 undergoing cell division in order to achieve stable 
integration hence, cells are collected from a subject often 
by removing blood or bone marrow. 

Several methods have been developed over the last decade for 
30 the transduction of genes into mammalian cells for potential 
use in gene therapy. In addition to direct use of plasmid 
DNA to transfer genes, episomal vectors, retroviruses, 
adenoviruses, parvoviruses, and herpesviruses have been used 
(Anderson et al., 1995; Mulligan, 1993; The contents of 
35 which are incorporated in their entirety into the subject 



WO 97/11972 



PCT/US96/15576 



-23- 

application) . For transfer of genes into cells ex vivo and 
subsequent reintroduction into a host, as would be most 
feasible in immunodeficiency patients, retroviruses have 
been the vectors of choice. 

5 

A separately preferred embodiment of the invention is a 
replicable expression vector encoding a chimeric protein 
capable of sequence specific DNA methylation. 

10 The replicable expression vector may be chosen from the 
group consisting of a pLS vector, a prokaryotic expression 
vector, a yeast expression vector, a baculovirus expression 
vector, a mammalian expression vector, and an episomal 
mammalian expression vector. The agent may comprise a 

15 mutated Lex A binding polypeptide linked to a 
methyltransf erase polypeptide. The agent may comprise a 
tridactyl zinc finger polypeptide that specifically binds to 
the HIV-l 5'LTR nucleic acid sequence linked to a 
methyltransf erase polypeptide. 

20 

Another embodiment of the invention is a pharmaceutical 
composition comprising a therapeutically effective amount of 
the replicable vector described hereinabove and a 
pharmaceutically acceptable carrier. The carrier may 
25 comprise a diluent. The pharmaceutically acceptable carrier 
may be an aerosol, intravenous, oral or topical carrier and 
is further described herein. 

Another embodiment of the subject invention is a method of 
30 obtaining an expression vector including DNA encoding a 
desired chimeric protein that includes at least a portion of 
a mutated DNA methyltransf erase protein and at least a 
portion of a DNA binding protein such chimeric protein being 
capable of inhibiting the expression of a target gene which 
35 comprises: (a) obtaining a population of expression vectors 
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each of which expresses a chimeric protein including (1) a 
DNA corresponding to an endogenous promoter sequence of the 
target gene, which promoter contains at least one sequence 
specific methylation site; (2) a DNA encoding a mutated DNA 
5 methyltransf erase; and (3) a DNA encoding a DNA binding 
protein, position adjacent the DNA of (2), DNA (2) and (1) 
being so positioned in the expression vectors as to permit 
expression of the chimeric proteins; (b) introducing the 
population of vectors from step {a) into an appropriate host 

10 under conditions such that the chimeric proteins are 
expressed and methylate DNA; { c) isolating the population 
of vectors from step (b) from the host; (d) treating the 
population of vectors from step ( c) with a suitable 
restriction endonuclease so as to digest the DNA at a 

15 specific site in the promoter if the site is not methylated; 
(e) introducing the population from step (d) into an 
appropriate host, which hosts have the property that they 
degrade DNA which has been non- specif ically methylated at 
sites other than the sequence specific methylation site 

20 contained in the promoter; and (f ) culturing the hosts from 
step (e) under conditions such that vectors which have not 
been degraded express either a mutated or a non-mutated form 
of the chimeric protein, thus obtaining at least one nucleic 
acid encoding a mutated form of a chimeric DNA 

25 methyltransferase/DNA binding protein capable of inhibiting 
the expression of the target gene. 

As used herein, "desired" includes optimal properties of a 
chimeric protein as described hereinabove. The properties 
30 include specifically jnethylating a DNA sequence in a 
promoter sequence of a target gene and inhibiting the 
activity of the promoter and thus inhibiting expression of 
the target gene. 

35 In the practice of any of the methods of the invention or 
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preparation of any of the pharmaceutical compositions an 
"effective amount" is an amount which is effective to 
produce the sequence specific methyltransf erase and 
inactivation of the target gene. Accordingly, the effective 
5 amount will vary with the subject being treated, as well as 
the condition to be treated. For the purposes of this 
invention, the methods of administration are to include, but 
are not limited to, administration cutaneously, 
subcutaneously, intravenously, parenterally , orally, 
10 topically, or by aerosol. 

As used herein, the term "suitable pharmaceutically 
acceptable carrier" encompasses any of the standard 
pharmaceutically accepted carriers, such as phosphate 

15 buffered saline solution, water, emulsions such as an 
oil/water emulsion or a triglyceride emulsion, various types 
of wetting agents, tablets, coated tablets and capsules. An 
example of an acceptable triglyceride emulsion useful in 
intravenous and intraperitoneal administration of the 

20 compounds is the triglyceride emulsion commercially known as 
Intralipid®. 

Typically such carriers contain excipients such as starch, 
milk, sugar, certain types of clay, gelatin, stearic acid, 
25 talc, vegetable fats or oils, gums, glycols, or other known 
excipients. Such carriers may also include flavor and color 
additives or other ingredients. 

Also comprehended by the invention are pharmaceutical 
30 compositions comprising therapeutically effective amounts of 
polypeptide products of the invention together with suitable 
diluents, preservatives, solubilizers , emulsifiers, 
adjuvants and/or carriers. A "therapeutically effective 
amount" as used herein refers to that amount which provides 
35 a therapeutic effect for a given condition and 
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administration regimen. Such compositions are liquids or 
lyophilized or otherwise dried formulations and include 
diluents of various buffer content (e.g., Tris-HCl., 
acetate, phosphate), pH and ionic strength, additives such 
5 as albumin or gelatin to prevent absorption to surfaces, 
detergents (e.g., Tween 20, Tween 80, Pluronic F68, bile 
acid salts). solubilizing agents (e.g., glycerol, 
polyethylene glycerol), anti-oxidants (e.g., ascorbic acid, 
sodium metabisulf ite) , preservatives (e.g., Thimerosal, 

10 benzyl alcohol, parabens) , bulking substances or tonicity 
modifiers (e.g., lactose, mannitol) , covalent attachment of 
polymers such as polyethylene glycol to the protein, 
complexation with metal ions, or incorporation of the 
material into or onto particulate preparations of polymeric 

15 compounds such as polylactic acid, polglycolic acid, 
hydrogels, etc, or onto liposomes, microemulsions, micelles, 
unilamellar or multilamellar vesicles, erythrocyte ghosts, 
or spheroplasts . Such compositions will influence the 
physical state, solubility, stability, rate of in vivo 

20 release, and rate of in vivo clearance. The choice of 
compositions will depend on the physical and chemical 
properties of the protein. For example, a product derived 
from a membrane -bound form of a protein may require a 
formulation containing detergent. Controlled or sustained 

25 release compositions include formulation in lipophilic 
depots (e.g., fatty acids, waxes, oils). Also comprehended 
by the invention are particulate compositions coated with 
polymers (e.g., poloxamers or poloxamines) and coupled to 
antibodies directed against tissue-specific receptors, 

30 ligands or antigens or coupled to ligands of t issuer specific 
receptors. Other embodiments of the compositions of the 
invention incorporate particulate forms protective coatings, 
protease inhibitors or permeation enhancers for various 
routes of administration, including parenteral, pulmonary, 

3 5 nasal and oral. 
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Another embodiment of the subject invention is a transgenic 
non-human mammal whose somatic and germ cells contain and 
express a gene coding for a desired chimeric protein which 
is capable of inhibiting the expression of a target gene, 
5 the gene having been stably introduced into the non-human 
mammal at the single cell stage or an embryonic stage, and 
wherein the gene is linked to a promoter and integrated into 
the genome of the non-human mammal. The desired chimeric 
protein may be the desired chimeric protein described 
10 hereinabove. One skilled in the art would be familiar with 
the experimental methods necessary to produce a transgenic 
mammal, as described in Leder et al . , U. S. Patent No. 
4, 736,866 and Krimpenfort and Berns, U. S. Patent No. 
5,175,384 and Wagner and Chen, U. S. Patent No. 5,175,385. 

15 

If it were possible to stimulate de novo methylation of HIV 
proviral DNA, then clonal inheritance of methylation 
patterns would render the inactivation essentially 
irreversible. As described herein, this irreversible 

20 inactivation is the basis of a therapeutic approach that has 
practical advantages over alternative molecular biological 
approaches (including antisense RNA, ribozymes, targeted 
nucleases, and intracellular anti-HIV antibodies; reviewed 
by Volberding, 1995) . Provirus silencing by targeted 

25 cytosine methylation stimulates an existing host defense 
system and as an analogy, has the virtues of a vaccine: the 
clonal inheritance of methylation patterns is like the 
immunological memory, and the strong suppression of 
transcription is like the immune defense system in that both 

30 perturb the function of particular gene products. 

It is important to point out that the therapeutic agent that 
methylates the target sequence need be present only 
transiently; the new methylation pattern would be propagated 
35 indefinitely by the cellular DNA methylating system (Bestor, 
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1990; Bestor, 1992; Bestor and Coxon, 1993;. Bestor and 
Verdine, 1994) . The alternate molecular biological 
approaches to medical therapies mentioned above, require 
indefinite expression of the agent. This has proven to be 
5 difficult in practice (Challita and Kohn, 1994) and may be 
toxic or may provoke a harmful immune response. An 
additional complication of long-term expression is 
insertional mutagenesis, while inactivation by targeted 
methylation as described herein may involve delivery of the 
10 agent by a nonintegrating, replication-defective virus that 
establishes an abortive infection. None of the alternative 
molecular biological therapies discussed above has these 
advantages . 

15 The 5' LTR of HIV-1 may be inactivated by targeted cytosine 
methylation. As shown in Figure 1 below, the LTR that 
directs transcription of the provirus is rich in binding 
sites for transcription stimulatory proteins (Nabel, 1993; 
Garcia and Gaynor, 1994) and rich in CpG sites, the 

20 predominant site of methylation in vertebrates . It is also 
well-established that methylation of only a few of the CpG 
sites in the 5' LTR represses transcription to nearly 
undetectable levels {Bednarik et al., 1990; Joel et al . , 
1993, reviewed by Bednarik, 1992), but the specific, 

25 critical sites that mediate this effect have not been 
identified. 

Mammalian promoters are silenced by cytosine-5 methylation 
at 5'-CpG-3' dinucleotides . The promoter within the 5' LTR 
30 of HIV-1 may be silenced by directing enzymatic methylation 
to critical CpG sites. Because methylation patterns are 
subject to clonal inheritance, the silencing is essentially 
irreversible . 

35 The subject invention encompasses but is not limited to the 
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following additional embodiments. 

1. Methylation of a predetermined CpG site. In one 
embodiment of the subject invention a sequence-specific DNA 

5 binding protein may be fused to a DNA cytosine 
methyltransf erase, and mutant fusion proteins whose 
methyltransferase activity is dependent on the DNA binding 
protein may be selected from combinatorial libraries. DNA 
binding protein/DNA methyltransferase chimeras that 

10 methylate only those CpG sites immediately adjacent to the 
binding site may be obtained by means of a novel cyclic in 
vivo/in vitro selection protocol. An attenuated version of 
the CpG-specific DNA methyltransferase M.SssI may be 
constructed in this way. In vitro selection requires 

15 resistance to methylation- sensitive restriction endonuclease 
cleavage at the target site, and in vivo selection removes 
plasmid molecules methylated at non-specific sites; this 
second step involves growth in AfcrBC—strains of E. coli, 
which degrade heavily methylated DNA. 

20 

2. Identification of those CpG sites in the HIV-1 5' LTR 
whose methylation produces maximal repression of transcrip- 
tion. The documented inhibitory effects of methylation on 
HIV-1 transcription are likely to be exerted through a 

25 subset of the 11 CpG sites in the 5' LTR of HIV-1. In a 
further embodiment of the subject invention, these critical 
CpG sites may be identified by transcription assays with 
test constructs that bear singly methylated CpG sites; each 
of the 11 CpG sites can be tested individually. The 
- 30 information obtained may be important for the selection of 
target sites in Examples 3-5. 

3. Design, selection, and affinity maturation of zinc 
finger-DNA methyltransferase chimeras that methylate 

35 critical CpG sites in the HIV 5' LTR. In another embodiment 
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of the subject invention, zinc finger modules may be 
catenated in novel combinations to yield proteins that bind 
with high affinity and specificity to predetermined 
sequences. Zinc finger proteins that bind to sites adjacent 
5 to critical CpG sites (identified in Example 2) may be 
selected from combinatorial expression libraries and fused 
to an attenuated CpG-specific DNA methyltransf erase 
(developed in Example 1) . Further rounds of cyclic in 
vivo/In vitro selection (as developed in Example 1) may be 
10 used to obtain variants that methylate only CpG sites 
adjacent to the binding site of the zinc finger moiety. 

4. Inhibition of HIV-1 5' LTR-dependent transcription in 
cultured human cells that express zinc finger -DNA methyl - 

15 transferase fusion proteins of novel and predetermined 
specificity. Example 3 yields constructs that methylate 
critical sites in the HIV-1 5' LTR. In a further embodiment 
of the subject invention, these constructs may be introduced 
into cultured human cells that express a reporter gene 

20 driven by the HIV-1 5' LTR as a means of testing the 
inhibitory effects of targeted de novo methylation. 

5. Inhibition of HIV-1 replication in human T lymphocytes 
productively infected with HIV-1. In another embodiment of 

25 the subject invention, constructs that prove effective in 
the studies of Example 4 may be introduced into HIV-1 
producing human T lymphocytes, and methylation of the target 
sequence and its effect on virus production can be measured 
directly. The methylating construct may be delivered on a 

3 0 plasmid vector or as part of a recombinant adeno- associated 
virus (AAV) . These embodiments suggest a high level of 
usefulness of targeted methylation in the control of HIV-1 
proliferation, and the development of the methylating 
construct into a therapeutic agent. 



35 
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Direction of a DNA methyltransf erase to any CpG site within 
HIV-l regulatory sequences, may result in the silencing of 
the provirus dependent upon the specific zinc finger 
protein/DNA methyltransf erase chimera. Zinc finger motifs 
5 have been shown to be remarkably versatile DNA binding 
structures (Choo and Klug, 1994); they are independent ly- 
folded protein modules that recognize the edges of 3 base 
pairs in the major groove. Over 1,300 zinc fingers have 
been identified, and the binding specificity of a large 

10 number has been determined (Choo et al . , 1995). Several 
laboratories have succeeded in assembling zinc finger 
modules in novel combinations; these designed proteins have 
been subjected to refinement by phage -display selection } and 
variants that bind to novel predetermined sequences with 

15 high specificity and nanomolar affinity have been obtained 
by several laboratories (Choo et al., 1994; Wu et al, 1995; 
Desjarlais and Berg, 1994; Rebar and Pabo, 1994) . These 
small proteins contain tandem arrays of three zinc finger 
modules and bind to 9 bp recognition sequences. Klug and 

20 colleagues were able to show that a zinc finger that was 
targeted to the BCR-ABL breakpoint sequence could bind to 
and suppress transcription of the BCR-ABL gene in cultured 
human leukemic cells (Choo et al., 1994). A zinc finger 
protein designed to bind to a sequence adjacent to a CpG 

25 site in the HIV-l 5' LTR can be fused to an attenuated CpG- 
specific DNA methyltransf erase, thereby achieving efficient, 
selective methylation of that CpG site. A DNA 

methyltransferase with the desired binding properties has 
been cloned from a Spiroplasma species; this enzyme, known 

30 as M.SssI, has the same sequence specificity as the 
mammalian DNA methyltransferase (CpG) (as cloned by Bestor; 
Genbank Accession No. X14805) , but a higher turnover number 
and no requirement for hemimethylated substrates (Renbaum 
and Razin, 1992). The strategy is shown in Figure 3. 

35 
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One embodiment of this invention is the development and 
therapeutic application of zinc finger protein/DNA 
methyltransf erase chimeras or agents whose general 
organization is similar to that of mammalian DNA 
5 methyltransf erase. These agents selectively methylate and 
inactivate the 5' LTR of HIV-1. Such agents have 
considerable promise as therapeutic agents for retroviral 
diseases, and for other applications where selective 
inactivation of a promoter sequence is desired. 

10 

The zinc finger protein/DNA methyltransf erase chimera of 
this invention is similar in design to mammalian DNA 
methyltransf erase, which has been purified (Bestor and 
Ingram, 1983; Bestor and Ingram, 1985a), characterized in 

15 terms of biochemical activities (Bestor and Ingram, 1985; 
Bestor, 1987) , the cDNA that encodes the enzyme cloned 
(Bestor, 1988; Genbank Accession No. X14805; Bestor et al . , 
1988) , and the protein demonstrated to be essential for 
mouse development in a gene disruption study (Li et al., 

20 1992; Li et al . , 1993). As shown below, mammalian DNA 
methyl transferase has an N- terminal regulatory domain and a 
C-terminal catalytic domain that closely resembles bacterial 
restriction methyltransf erases . For a recent review of the 
structure and function of mammalian DNA methyltransf erase, 

25 see Bestor and Verdine (1994) . The agents which are 

embodiments of this invention differ from mammalian DNA 
methyltransf erase in that the regulatory domain targets the 
catalytic domain to predetermined sequences . The regulatory 
. 30 region of mammalian DNA methyltransf erase suppresses the 
methylation of previously unmethylated sites (Bestor, 1992) , 
and directs the protein to sites of new DNA synthesis in S 
(synthesis) phase nuclei (Leonhardt et al . , 1992). The 
targeted methylation which is an embodiment of this 
3 5 invention is grounded in the knowledge of the basic biology 
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of the system that establishes and maintains me thylat ion 
patterns in the mammalian genome . 



The expression construct pCAL7 directs the expression of the 
5 DNA methyltransf erase M.SssI (Renbaum and Razin, 1992) . 
This enzyme in present in Spiroplasma species and has the 
same sequence specificity as mammalian DNA methyltransf erase 
(CpG) , but a higher turnover number and no requirement for 
hemimethylated substrates (Renbaum and Razin, 1992) . The C- 

10 terminal domain of mammalian DNA methyltransf erase also has 
CpG-specific DNA methyltransf erase activity and may also be 
used in practicing this invention/ although k C8t is much 
lower than that of M.SssI (Bestor, 1992; Renbaum and Razin, 
1992) . The plasmid construct described in Example 1 may be 

15 confirmed by restriction analysis and is described in full 
in the following section and depicted in Figure 5. This 
construct embodies (in a simple form) most of the principles 
of the targeted methylation approach described herein. 
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Table 1 shown below, offers data which validates the 
selection against indiscriminate methylation via McrBC 
restriction of methylated DNA (Table 1) . 

5 



£. coli strain 


Input plasmid 




CFU fig DNA" 1 


(Mcr genotype) 


(methylation status) 






ER1381 (McrBC 4 ") 


pSE4 (— ) 


1 


. 1 x 10 9 


ER1793 (McrBC*) 


pSE4 (-) 


1 


. 7 x 10 9 


ER1381 (McrBC*) 


pCAL7 ( + ) 


6 


x 10 5 


ER1793 (McrBCT) 


pCAL7 ( + ) 


8 


x 10 8 



Table 1, Selection against nonspecific methylation in 
McrBC* E. coli. pSE4 is an unmethylated control plasmid; 

15 pCAL7 directs synthesis of M.SssI (Renbaum et al . , 1992) 
under the control of the p tac promoter. Plasmids were 
introduced into the indicated bacterial strains (Raleigh, 
1992; Sutherland et al . , 1992) by electroporation, and 
transformation efficiencies determined by colony counts on 

20 selective media. The values shown are conservative, as IPTG 
induction was not used in this case, and the plasmid was 
only partially methylated as a result of basal expression 
from the p tac promoter; measurements of Hpall resistance 
indicated <50% methylation. IPTG treatment increased the 

25 methylation level and increased restriction from a factor of 
~10 3 to 10 4 . 

Promoter inactivation via targeted cytosine methylation. 

30 A lambda repressor- integrase fusion protein has been shown 
to direct HIV-1 integration to sites in the vicinity of 
lambda operators (Bushman, 1994) . Through an analogous 
mechanism, a chimeric DNA binding protein/DNA 
methyltransferase will cause preferential methylation of 



WO 97/11972 



PCT/US96/15576 



-35- 

sites in the vicinity of the recognition sequence of the DNA 
binding protein through a very large increase in the local 
concentration of DNA methyltransf erase . As shown in Figure 
3, this may be used to selectively methylate and silence 
5 predetermined promoter sequences. 

Crystallography data have shown that DNA cytosine 
methyltransferases make base-specific contacts in the major 
groove, and numerous sequence -independent contacts with the 
10 DNA backbone (Bestor and Verdine, 1994) . A diagram of these 
contacts is shown in Figure 5. 

Mutations of residues involved in the backbone contacts 
lowers the affinity of the enzyme for DNA and reduces the 

15 rate of methyl transfer. However, tethering the weakened 
DNA methyltransferase to a sequence-specific DNA binding 
protein greatly increases the concentration of the DNA 
methyltransferase in the vicinity of the binding site, thus 
offsetting the effects of reduced affinity. This results in 

20 a chimeric protein that methylates sites only in the immedi- 
ate vicinity of the recognition sequence of the DNA binding 
protein. Toxicity due to methylation of collateral CpG 
sites is obviated. Rather than estimating the nature of 
mutations that would yield the desired result, proteins with 

25 the desired properties may be selected from large 
combinatorial libraries. The design and selection of such 
chimeric proteins, and their development into potential 
therapeutic agents, is an embodiment of the subject 
invention. The 5' LTR of HIV-1 may be one target of 

30 inactivation, and a rational, step-wise approach to the this 
is described below in the Experimental Details. 

Gene Inactivation by Targeted Methylation 

35 1. Additional embodiments of methylating agents. 
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As mentioned above, the zinc finger/DNA methyltransf erase 
chimeras discussed and described herein are analogous in 
structure to mammalian DNA methyltransf erase, the enzyme 
that normally establishes and maintains methylation patterns 
5 in the genome. This enzyme has a zinc binding site that has 
been shown to participate in the discrimination of 
unmethylated and hemimethylated CpG sites (Bestor, 1992) . 
Replacement of this zinc-binding region with a tridactyl 
finger region of known binding specif icity * confers a new de 

10 novo specificity on the enzyme. Such an enzyme may be less 
immunogenic, and the efficiency of methylation may be 
increased because the coupling of DNA replication and 
methylation would be restored (Leonhardt et al., 1992). New 
generations of increasingly effective methylating agents may 

15 be developed using this approach. 

2 • Delivery of the methylating agent 

Described herein is a new therapeutic agent. Many delivery 
agents are described and encompass embodiments of this 

20 invention which are technically sufficient to allow delivery 
to the target cell population. The AAV transducing vector 
used in the experiments of Example 5 has many advantages; 
although other transducing vectors with superior properties 
are also encompassed in this invention. Delivery of the 

25 methylating agent to a target cell population has no addi- 
tional difficulties over those associated with other molecu- 
lar biological therapeutics. 

As pointed out earlier, in favorable cases the methylating 
3 0 agent, need be present only transiently, since the new 
methylation pattern is subject to clonal inheritance, and 
gene silencing mediated by this agent is essentially irre- 
versible. This advantage is not enjoyed by alternative 
molecular biological therapeutics. In cases of recurrent 
35 infection or relapse due to silencing of an inadequate 
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fraction of the proviral load, further treatments with the 
original agent or an agent that recognizes a different 
critical CpG site may be undertaken. The highly versatile 
nature of the methylating agents makes such an approach more 
5 favorable than the extant molecular biological therapeutic 
agents* 

3. Prophylaxis in humans and animals 

Targeted methylation may also be applied to prophylaxis and 

10 the prevention of recurrent infections. One promising route 
involves a zinc finger/DNA methyltransf erase chimera that 
resides in the cytoplasm by virtue of the lack of nuclear 
localization signals, and which methylates incoming viral 
DNA during its transit through the cytoplasm. Previous work 

15 has demonstrated that retroviral DNA is accessible to 
nucleases (and therefore to methyltransf erases of similar 
size) while in the cytoplasm (Bowerman et al. f 1989), and 
methylation prior to integration would yield a provirus that 
is transcriptionally silenced from the time of integration. 

2 0 Long-term, stable expression of such a construct could 
protect recurring infections with HIV-1 or other susceptible 
viruses. Successful development of constructs that can 
selectively silence specific viral promoters justifies the 
development of prophylactic derivatives for medical and 

25 veterinary applications and are also included as an 
embodiment of the instant invention. 

4. Applications of targeted methylation to other gene 
inactivation studies 

30 The methylating agents described here are very versatile, 
and can be targeted to other promoters by installation of 
zinc finger moieties of the appropriate sequence 
specificity. The choice of HIV-l as a target in this 
proposal was motivated by the severe threat to public health 

35 posed by this virus rather than by any special 
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vulnerability. Slight modification of the design and 
selection procedures can produce agents that methylate any 
predetermined sequence, and such agents may be useful in any 
case in medicine or experimental biology where it is 
5 desirable to silence a given promoter. 

This invention is illustrated in the Experimental Detail 
section which follows. These sections are set forth to aid 
in an understanding of the invention but are not intended 
10 to, and should not be construed to, limit in any way the 
invention as set forth in the claims which follow 
thereafter. 
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Experimental Details 

In the construction of a nucleic acid vector (see Figure 6), 
LexA was chosen for its proven ability to direct fused 
5 proteins to LexA binding sites (Brent and Ptashne, 1986), 
and M.SssI was chosen because of its high specific activity 
and the fact that it is the only bacterial methyltransferase 
that has the same specificity (5'-CpG-3'; Renbaum and 
Razin, 1992) as the mammalian enzyme (Bestor and Ingram, 
10 1983) . 

Example 1. Targeted de novo methylation. 

The product of the construct represented in Figure 6 yields 
an increased local concentration of DNA methyltransferase in 

15 the vicinity of LexA binding sites. As described above, 
this alone does not ensure targeted methylation, as the 
M.SssI moiety retains intrinsic activity towards all CpG 
sites and substantial methylation of collateral sites is to 
be expected; such indiscriminate methylation is lethal to 

2 0 mammalian cells. It is therefore necessary to make the DNA 
methyltransferase moiety dependent on LexA-mediated DNA 
binding; this may be accomplished by selection of mutant 
versions of M.SssI that have reduced intrinsic DNA binding 
activity. A novel cyclic in vivo/in vitro selection 

25 protocol is used to select mutant proteins of the desired 
character. Resistance to cleavage by a methylation- 
sensitive restriction endonuclease provides in vitro 
selection for methylation of the target site; growth of 
cleavage- resistant pools of plasmid in McrBC strains of E. 

30 coli selects against chimeras that methylate non-target 
sites. (Mcr was named from modified cytosine restriction; 
the system causes the degradation of plasmid DNA that is 
methylated at many positions but has no effect on DNA that 
is methylated at single sites ;Raleigh, 1992) . The selection 

35 scheme is depicted in Figure 7. 
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As mentioned above, the initial construct methylates both 
specific (that is, the Smal target site) and non-specific 
CpG sites elsewhere on the plasmid. it is therefore 
necessary to transfer DNA binding authority from the 
5 catalytic moiety to the DNA binding moiety; this is done by 
selecting for mutations that prevent the methylation of non- 
specific sites while allowing methylation of the specific 
site. 

10 It cannot be predicted as to which mutations might give the 
desired reduction in affinity for DNA, so random mutations 
are introduced and selection is applied to obtain mutants of 
the desired character. The M.SssI moiety is mutagenized by 
Mn++ PCR (PCR amplification in the presence of Mfl causes 

15 misincorporation of nucleotides and therefore transition 
and transversion mutations) from primers just external to 
the BamHI and AscI sites (the primers are depicted as arrows 
in Figure 8) . The PCR product is cleaved with BamHI and 
AscI, the fragment purified by gel electrophoresis and 

20 ligated between the AscI and BamHI sites of pLS (Figure 6) . 
The ligation products are electroporated into JM105 
(McrBC) . Calculations indicate that optimal ligation and 
elect roporat ion conditions allow selection to be applied to 
>10 9 separate clones per screen (J. Hill and T. Bestor, data 

25 not shown) . After 20 min growth in rich liquid medium at 
37°C and 150 min in rich medium supplemented with 50 fig ml" 1 
ampicillin, the cells are washed 3 times and plasmid DNA 
extracted by the alkaline lysis procedure. The DNA is 
treated with 5 units ^g" 1 Smal for 2 hr at 25°C, then 

30 transformed into McrBC+ E. coli strain ER1381. As 
previously described, the McrBC system degrades DNA that is 
methylated at many sites but does not affect DNA methylated 
at single sites (Raleigh, 1992) . It may be necessary to 
perform more than one iteration of the mutagenesis/selection 

35 procedure, but the process is not demanding and each it era- 



WO 97/11972 



PCT/US96/15576 



-41- 

tion of the complete cyclic in vivo/in vitro selection 
method can be completed in a few days. 

An additional selection step may be added to the cyclic 
5 selection procedure if the previous two steps give insuffi- 
cient enrichment for the clones of interest. Plasmid pools 
are extracted from McrBCz bacteria and cleaved with Smal 
together with one of the 6 methylation-sensitive enzymes 
that have unique sites in pLS. If both sites are 

10 methylated, no cleavage occurs. If both' sites are 
unmethylated, both will be cleaved. Only if one site is 
methylated and the other unmethylated will unit-length 
molecules with compatible ends be produced. These unit- 
length linear molecules are isolated by agarose gel 

15 electrophoresis, circularized by treatment with DNA ligase, 
and reintroduced into E, coli by electroporation. Note that 
this selection step, as in the case of the other two, can be 
completed in less than two days. 

20 The procedure described above is designed to produce a 
chimeric protein that methylates a predetermined target site 
adjacent to the binding site of a sequence- specif ic DNA 
binding protein. Candidate clones may be confirmed by 
simple assays. To confirm that Smal resistance is due to 

25 methylation of the target site and not to loss of the site 
by mutation, candidate plasmids are tested for resistance to 
Smal and sensitivity to the methylation- insensitive 
isoschizomer Xmal (both enzymes recognize the sequence 
CCCGGG) . Constructs that pass this test are examined for 

30 non-specific methylation by testing for sensitivity to the 
methylation-sensitive endonucleases BstUl (CGCG) , Hpall 
(CCGG) , Maell (ACGT) , Hhal (GCGC) . Resistance and sensitiv- 
ity is assessed from banding patterns after agarose gel 
electrophoresis. It can be concluded that targeted 

3 5 methylation was achieved upon recovery of plasmids that are 
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resistant to Smal and sensitive to Xmal, BstUl, Hhal, Hpall, 
and Maell. 



If clones of the desired binding and enzymatic specificity 
5 do not emerge from the screen, a likely reason may be 
stearic incompatibility of the DNA binding protein and DNA 
methyl transferase in the orientation shown in Figure 6. New 
constructs may be made in which the order of the moieties 
are reversed, and the selection screen described above may 
10 be done on these new constructs. 



The number of iterations of the mutagenesis/selection 
protocol that is required to produce constructs with the 
required characteristics may be variable. However, the 
procedure is rapid, and multiple iterations are not 
demanding . 



Constructs that have the desired binding and enzymatic 
specificity are sequenced in order to identify mutations 

20 that make DNA methyltransf erase activity dependent on the 
binding energy of the DNA binding fusion partner. The 
nature and position of the mutations are referred to the 3D 
structure of cytosine methyltransf erases M.Hhal 
(Klimasauskas et al. # 1994) and M.Haelll (Reinisch et al., 

25 1995) ; both of these enzymes are closely related to each 
other and to M.SssI, and conservation of tertiary structure 
is very likely. The desired mutations may map to one of the 
many residues that make nonspecific contacts with the DNA 
backbone (reviewed by Bestor and Verdine, 1994) . These 

30 mutations are introduced into the starting constructs of 
Example 3 so as to reduce the amount of selection to obtain 
constructs of the desired character. 



Example 2. Identification of those CpG sites in the HIV-1 5' 
35 LTR whose methylation produces maximal repression of 
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transcription. As described previously, viral transcription 
is strongly repressed by methylation of the HIV-1 5' LTR 
(reviewed by Bednarik, 1992) . It is very likely that this 
effect is exerted through a subset of the 11 CpG sites 
5 present in the 5' LTR of HIV-1 (see Figure 1) . The critical 
CpG sites are identified by measurements of transcription 
rates of test constructs that bear single methylated CpG 
sites. The CpG site that gives the highest degree of 
inhibition when methylated serves as the target for Examples 
10 3 and 4 . 

The approach is related to methods established by 
Busslinger et al. (1983). The Pstl-BamHI insert of pUC- 
BENN-CAT (which contains the HIV-l 5' LTR driving 

15 transcription of the bacterial chloramphenicol 
acetyltransferase (CAT) gene) (Gendelman et al . , 1986) is 
excised and cloned into M13 mpl8. Single -stranded phage DNA 
is prepared, and synthetic oligonucleotide primers that bear 
m 5 C at single specific CpG sites are hybridized to an M13 

20 clone of the sequence of interest. The primer is extended 
with a DNA polymerase; sequenase is the polymerase of choice 
for this invention because of its high processivity and low 
yield of partially extended products. The nick at the 5' 
end of the primer is sealed by incubation with E. coll DNA 

25 ligase and the circular double stranded molecules are 
purified away from single stranded template and 
oligonucleotide primer by agarose gel electrophoresis. The 
circular DNA is purified from gel slices, cleared of 
ethidium bromide by extraction with phenol, collected by 

3 0 ethanol precipitation, and mixed with equimolar amounts of 
an act in promoter- lucif erase expression construct prior to 
transfection into HL2/3 cells (Ciminale et al., 1990). This 
cell line expresses high levels of the HIV-1 proteins Gag, 
Env, Tat, Rev, and Nef proteins and affords a good 

35 simulation of HIV-1 transcription in infected T cells; it is 
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especially important to identify CpG sites that suppress 
transcription when methylated and in the presence of the 
virus-encoded stimulatory factor Tat. The hemimethylated 
sites introduced into the constructs are converted to 
5 symmetrically methylated sites immediately upon transfection 
(Busslinger et al., 1983) . The ratio of CAT activity to lu- 
ciferase activity (both are measured by simple and well-es- 
tablished assays) are taken as the measure of inhibition by 
CpG methylation. Control constructs are either completely 

10 unmethylated or completely methylated (by extension of the 
primer in the presence of dm 5 CTP) . The location of the 11 
CpG sites and the method of synthesis of the methylated 
expression constructs is shown in Figure 5. The plasmid 
pUC-BENN-CAT and the cell line HL2/3 are distributed by the 

15 NIH AIDS Research and Reference Reagent Program, and have 
been obtained from that source. This experiment is 
straightforward, and the results are invaluable for the 
experiments described in Examples 3, 4, and 5. The assays 
are done both in transiently transfected cells, and in cell 

20 clones that have stably integrated the construct; these 
clones are established by G418 selection after unlinked co- 
transformation with pSV2Neo and the pUC-BENN-CAT constructs. 

Example 3. Design, selection, and affinity maturation of 
25 zinc finger-DNA methyl transferase chimeras that methylate 
critical CpG sites in the HIV 5' LTR. There are three 
important factors to be taken into account when selecting a 
site for targeted methylation in the HIV-1 5' LTR. First, 
the site must have the ability to repress transcription when 
3 0 methylated; this is established in the experiments of 
Example 2. Second, the binding site for the zinc finger 
moiety must be adjacent to (rather than within) the 
recognition sequence of a regulatory factor. The factors 
that bind to the 5' LTR are all host -encoded and are 
35 involved in the transcription of many cellular genes; 
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methylation of such sites is expected to cause unacceptable 
toxicity through inactivation of vital cellular genes. 
Third, the recognition sequence is preferably from 6 to 25 
base pairs in length and more preferably at least 9 base 
5 pairs in length, because such sequences occur by chance only 
every 4 9 (or 262,144) base pairs. This rarity ensures that 
target sites do not occur by chance in significant numbers 
of cellular promoters. 

10 Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 
display, and fused to the CpG-specif ic DNA methyl transferase 
M.SssI as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 

15 in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 
finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 

20 few days are required for each iteration, and large numbers 
of clones (>10 9 per cycle) can be subjected to selection at 
once . 

The method for the selection of tridactyl zinc finger 
25 proteins that bind to predetermined sequences is shown in 
Figure 9 . 

The method depicted in Figure 9 is used to select a 
tridactyl finger protein that binds to a 9 bp sequence adja- 

30 cent to a critical CpG site (identified in Example 2) . The 
spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). This 

35 spacing is equivalent to roughly 25 base pairs. 
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The selected finger protein is fused to M.SssI 
methyltransferase via a random linker to create a second 
combinatorial expression library. From this point forward 
the protocol is very similar to that described in Example 1. 
5 The construct differs from that shown in Figure 6 only in 
that LexA is replaced with the coding region for the zinc 
finger protein, and the LexA binding site is replaced with 
the HIV-1 LTR sequence used for selection. The c-MYC 
epitope tag is included to allow visualization of the 
10 protein on immunoblots and by immunofluorescence. The 
sequence and spacing of the central CpG in the Smal site and 
the recognition sequence for the zinc finger protein is the 
same as that in the HIV-l LTR target sequence; a minimal 
change that converts the sequences flanking the CpG site to 
15 a Smal site is introduced. This change should not pose any 
problems, as the intrinsic sequence specificity of M.SssI is 
limited to the CpG dinucleotide (Renbaum and Razin, 1993) . 
As was previously done with the LexA construct, cyclic in 
vivo/in vitro selection is used to obtain mutant chimeras in 
20 which the DNA methyltransferase moiety depends on the finger 
protein part of the energy of DNA binding. This obviates 
methylation of collateral CpG sites, which is known to be 
toxic to mammalian cells. As with the LexA construct of 
Example 1, it may be confirmed that Smal resistance is due 
25 to methylation of the target site, and not to loss of the 
site by mutation, by testing for resistance to Smal and 
sensitivity to the methylation- insensitive isoschizomer 
Xmal. Constructs that pass this test are examined for non- 
specific methylation by testing for sensitivity to the 
30 methylation-sensitive endonuclease BstUl (CGCG) , Hpall 
(CCGG), Maell (ACGT) , Hhal (GCGC) . Almost 100 CpG sites 
within the plasmid construct may be tested through use of 
these enzymes. Resistance and sensitivity is assessed from 
banding patterns after endonuclease treatment and agarose 
35 gel electrophoresis. As in Example 1, it may be concluded 
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that targeted methylation has been achieved upon recovery of 
plasmids that are resistant to cleavage by Smal and 
sensitive to Xmal, BstUl, Hhal, Hpall, and Maell. 

5 Alternatively, as with the LexA construct, constructs in 
which the order of the zinc finger and M.SssI moieties has 
been reversed are constructed and subjected to the selection 
protocol described above. As in the case of the pLS 
constructs described in Example 1, the data indicate that it 
10 is possible to screen between 10 9 and 10 10 independent clones 
at once, and each iteration of the selection protocol 
requires only a few days. 

Example 4. Inhibition of HIV-l 5' LTR -dependent 

15 transcription in cultured human cells that express zinc 
finger/DNA methyl transferase fusion proteins of novel and 
predetermined specificity* Example 3 yields constructs that 
methylate critical sites in the HIV-l 5' LTR. These 
constructs are introduced into cultured human cells that 
20 express a reporter gene from the HIV-l 5' LTR as a means of 
testing the inhibitory effects of targeted de novo methy- 
lation. 

The plasmid pUC-BENN-CAT {Gendelman et al., 1986) is 
25 introduced into HL2/3 cells (Ciminale, 1990) together with 
unlinked pSV2Neo, and stable transf ectants selected with 
G418. The HL2/3 cell line expresses most of the HIV-l 
accessory proteins (including Tat) from a replication- 
defective HIV-l provirus (Ciminale et al., 1990). The 
30 presence of HIV-l viral accessory proteins renders this 
assay a good simulation of HIV-l transcription in infected 
cells. Transf ectant clones that express the CAT gene from 
the HIV-l LTR are identified by RNA blot hybridization. 
Such cell clones are referred to as HL2/3 -LTR-CAT . 

35 
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HL2/3-LTR-CAT cells are transfected by the calcium phosphate 
technique with a mammalian expression vector that directs 
production of the zinc f inger-M. SssI fusion protein (Example 
3) from the cytomegalovirus (CMV) immediate early enhancer- 
5 promoter. The starting vector is pEVRFO (Matthias et al . , 
1990) , which contains the CMV promoter, a beta globin intron 
in the 3' untranslated region for enhanced transcript 
stability, and a polylinker region for convenient cloning. 
We have used this vector with good success in many 

10 constructs (Czank et al., 1991; Leonhardt et al., 1992). 
This expression construct is termed pCMV-ZMet. A pSV2Hyg 
resistance marker is co-transf ected with pCMV-ZMet, and the 
cells subjected to double selection with G418 and 
Hygromycin. Production of zinc f inger/M. SssI fusion protein 

15 are assayed by immunoblot analysis; as with the LexA-M.SssI 
fusion protein of Figure 3, an N-terminal c-MYC epitope tag 
is present, and the protein is visualized with the 9E10 
antibody. 

20 Immunofluorescence microscopy is used to determine whether 
the methylating agent can gain access to nuclei. If it is 
found to be confined to the cytoplasm, the • nuclear 
localization signal of SV40 large T antigen may be added to 
the N- terminus, as we have done for a class of fusion pro- 

25 teins to induce their translocation into nuclei (Leonhardt 
et al. , 1992) . 

Cells that express zinc f inger/M. SssI fusion protein are 
examined for loss of CAT expression by CAT enzyme assay and 

3 0 by RNA blot hybridization. In the event that the most 
critical CpG site happens to lie within one of the four 
sites for methylation-sensitive restriction endonuclease 
(four of the eleven sites can be assayed in this way; see 
Figure 1) , methylation may be assayed by cleavage of DNA 

3 5 with that enzyme followed by DNA blot hybridization. This 
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is a simple and long-established technique for determination 
of the methylation status of individual CpG sites. If the 
critical site chosen for targeted methylation does not lie 
within such a restriction site, the genomic sequencing 
5 method of Grigg and Clark. (1994) may be used to identify 
methylated cytosines. The method of Collins and Myers 
(1987) can also be used to detect methylation differences. 
This method exploits the increased melting temperature of 
methylated DNA, which allows methylated and unmethylated 
10 fragments to be resolved by denaturing gradient gel 
electrophoresis (Collins and Myers, 1987) . Both methods are 
well-established and both will allow unambiguous 
localization of methylated sites in HIV-1 LTR sequences. 

15 The embodiments described herein of this invention indicate 
that expression of a finger protein/M. SssI chimera targeted 
to a critical CpG site in the HIV-1 LTR methylates and 
inactivates expression of a reporter gene that is driven by 
the LTR. 

20 

Example 5. I n h ibition of HIV-1 replication in T lymphocytes 
productively infected with HIV-1. Jurkat E6-1 cells are 
electroporated with infectious molecular clones of HIV-1 
strain NL4-3, and a productive infection established. The 

25 HIV-producing cells are transfected with the finger 
protein/M. SssI construct described in the previous section, 
together with pSV2Neo as a selectable marker. A vector con- 
trol and a finger protein-only control may be used. This 
latter control is necessary to distinguish inhibition via 

30 methylation from that resulting from direct binding of the 
finger protein to its target site, as has happened in one 
other case (Choo et al., 1995). The finger protein/M. SssI 
chimera is designed to cause de novo methylation and 
transcriptional repression of HIV-1 proviral DNA. 

35 Methylation is assayed as described in the previous section, 
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and viral transcription is assayed by RNA blot hy- 
bridization. 

The plasmid-mediated system test described above is 
5 convenient and reliable but does not simulate conditions 
that may be encountered in a therapeutic application. A 
vector that addresses the needs of therapeutic applications 
is therefore used and described below. One of the most 
promising viral vectors is based on adeno-associated virus - 
10 (AAV) , and the methylating agent may be delivered through 
use of recombinant AAV. 



AAV has several advantages as a transducing vector: I. AAV 
is not associated with any disease, ii. The viral DNA is 

15 thought to integrate at a unique site on chromosome 19q, 
with little risk of insertional mutagenesis, iii. it has 
been reported that AAV can infect non-replicating cells 
(reviewed in Berns and Linden, 1995) , iv. AAV can infect 
lymphoid cells (Mendelson et al . , 1992), and transcription 

20 units borne by recombinant AAV strains are expressed in T 
lymphocytes (Chatterjee et al., 1992), and v. Virulent 
strains of AAV cannot arise by recombination within infected 
cells, as can occur with retroviral vectors. 



25 Introduction of methylating agents into transducing AAV 
vectors is straightforward. The finger protein/DNA 
methyltransf erase coding region under control of the CMV 
immediate early enhancer -promoter is amplified by PCR with 
primers that introduce Xbal sites at both ends of the 

30 construct. This expression cassette is ligated between the 
Xbal sites of pAVl, an infectious molecular clone of AAV 
(Banerjee et al., 1992). The resulting clones are selected 
and propagated in E. coli, and confirmed by direct 
sequencing; clones of the desired specificity are termed 

35 pAV-ZMet. Control clones that express only the zinc finger 
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protein are constructed and termed pAV-Zif; clones that 
express only the mutant M.SssI are constructed and termed 
pAV-Met. Infectious recombinant virus stocks are obtained 
by co-transfection of adenovirus -infected HeLa cells with 
5 pAV-ZMet, pAV-Zif, or pAV-Met, together with pTAAV, a 
plasmid that encodes proteins required for replication and 
packaging (Banerjee et al., 1992). The titer of virus 
stocks obtained in this manner are commonly 10 7 ml 1 
(Banerjee et al., 1992); such viruses are also capable of 
10 infecting CD4* T lymphocytes such as Jurkat cell with high 
efficiency (Mendelson et al., 1992). The recombinant 
viruses are termed AAV-ZMet , AAV-Zif, and AAV-Met, in 
keeping with the terminology of their plasmid precursors. 

15 Jurkat cells productively infected with HIV-1 strain NL.4-3 
(as described above) are infected with AAV-ZMet, AAV-Zif, or 
AAV-Met at an MOI that infects >90% of the cell population, 
as established by immunofluorescence microscopy with 
antibodies against the c-MYC epitope encoded by the fusion 

20 proteins. Jurkat cells are highly susceptible to infection 
by AAV (Mendelson et al., 1992) RNA blot hybridization is 
used to evaluate the effect of expression on transcript 
accumulation, as has been done in antisense inhibition 
experiments that utilized AAV expression vectors (Banerjee 

25 et al., 1992). The methylation status of integrated HIV-1 
proviral DNA may be evaluated as described in Example 3 . 
Comparison of transcript levels, cell survival, and proviral 
methylation among cultures infected with vector control, 
AAV-ZMet, AAV-Zif, and AAV-Met can give a clear indication 

3 0 of the efficacy of targeted methylation in suppressing HIV-1 
expression under conditions that approximate those to be 
encountered in actual therapeutic applications. 

Example 6. Design, selection, and affinity maturation of 
35 zinc finger-DNA methyl transferase chimeras that methylate 
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critical CpG sites in a promoter of the hepatitis B virus. 

Chronic hepatitis due to infection with Hepatitis B virus 
(HBV) is a major cause of cirrhosis of the liver and 
hepatocellular carcinoma (reviewed by Ganem and Varmus, 
1987). Approximately 1.5% of the population is infected 
with HBV (reviewed by Saracco and Rezzetto, 1995) and the 
worldwide morbidity and mortality associated with chronic 
HBV infection is far greater than that due to HIV-l 
infection. As shown in Figure 10, the HBV genome is rich in 
CpG sites (there are 103 in a genome of only 3.2 kb; Ono et 
al., 1983) and transcription from HBV promoters is sensitive 
to CpG methylation (Miller and Robinson, 1983; Pourcel et 
al., 1990/ The contents of these references are incorporated 
15 in their entirety into the subject application) . HBV 
proviral DNA is therefore susceptible to inactivation by 
targeted methylation according to the methods elaborated for 
HIV-l inactivation herein. 

20 Identification of critical CpG sites, design of chimeric 
zinc finger protein/DNA methyltransf erases, and cyclic in 
vivo/in vitro selection of chimeras that methylate only the 
target CpG site is as described for the HIV-l targets in the 
above examples. 

25 

One embodiment of the subject invention is to target the 
known promoters of the hepatitis B virus and inactivate the 
viral machinery via methylation. The site for targeted 
methylation in the hepatitis B promoter is selected as 

30 follows. First, the site will have the ability to repress 
transcription when methylated; this is established in the 
experiments of Example 2 above. Second, the binding site 
for the zinc finger moiety will be adjacent to (rather than 
within) the recognition sequence of a regulatory factor. 

3 5 The factors that bind to the promoter of the viral X protein 
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or the pre-S2/5 promoter are host -encoded and are involved 
in the transcription of many cellular genes; methylation of 
such sites is expected to cause unacceptable toxicity 
through inactivation of vital cellular genes. Third, the 
5 recognition sequence is optimally from 6 to 25 base pairs in 
length and more preferably at least 9 base pairs in length. 

Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 

10 display, and fused to the CpG-specific DNA methyltransf erase 
M.SssI as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 
in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 

15 finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 
few days are required for each iteration, and large numbers 
of clones (>10 9 per cycle) can be subjected to selection at 

20 once. 

The method for the selection of tridactyl zinc finger 
proteins that bind to predetermined sequences is shown in 
Figure 9 . 

25 

The method depicted in Figure 9 is used to select a 
tridactyl finger protein that binds to a 9 bp sequence adja- 
cent to a critical CpG site in either the promoter of viral 
protein X or the pre-S2/5 promoter of hepatitis B. The 
30 spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). This 
spacing is equivalent to roughly 25 base pairs. 

35 
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Figure 11 A and Figure 11B illustrate targeted methylation 
of a preselected CpG site in a pLS vector which encodes a 
lexA-Af.SssI fusion protein. This data proves that a CpG- 
specific DNA methyltransf erase can be directed to a target 
site that is adjacent to the binding site of a sequence- 
specific DNA binding protein. This targeted methylation was 
achieved in living bacterial cells. Mutagenesis and cyclic 
in vivo/in vitro selection is being applied so as to obtain 
a chimeric protein that will methylate the target site but 
have no effect on collateral CpG sites. The data shown 
above and specifically in Figures 11A-11B, demonstrate that 
the principle of targeted methylation has been reduced to 
practice in the laboratory. 

15 Example 7: Targeted Methylation in the Repression of HIV-1 
Replication 

A. Construction of chimeric DNA methyltransf erase of novel 
and predetermined specificity. 

20 

The Lac repressor was fused to the N- terminus of M.SssI by 
recombinant DNA technology and the chimeric protein was 
expressed in E. coli. The construct used in these 
experiments is shown in detail in Figure 12. The construct 

25 also contained a synthetic sequence that included the Lac 
repressor binding site and a cluster of CpG dinucleotides . 
This construct was used to establish three important points: 
First, that chimeric DNA methyltransf erase could be produced 
in bacterial cells in a stable and soluble form; second, 

30 that the chimeric DNA methyltransf erase would selectively 
methylate CpG sites in the vicinity of the recognition 
sequence of the lac repressor, and third, the optimal 
spacing between recognition sequence and target CpG site is 
20-30 nucleotides. 
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The construct shovm in Figure 12 was electroporated into E. 
coli and synthesis of fusion protein induced by addition of 
isopropylthiogalactoside to culture medium. After 3 hours 
of incubation cells were lysed, and protein extracts were 
5 subjected to SDS gel electrophoresis and immunoblot analysis 
with an anti-Lac repressor antibody. As shown in Figure 13, 
the chimeric DNA methyltransf erase was soluble and resistant 
to proteolysis as shown by its appearance as a single band 
of the appropriate size. There was little sign of a 
10 reduction in growth rate upon induction of synthesis of the 
chimeric DNA methyltransf erase . This confirmed that the 
chimeric DNA methyltransf erase was not toxic to the host 
cells . 

15 Plasmid DNA was purified from the same cells and high 
resolution mapping of methylated sites was carried out by 
the bisulfate genomic sequencing method. Bisulfite converts 
cytosine to uracil by oxidative deamination. 5- 
methylcytosine is resistant to bisulfite attack. Upon PCR 

20 amplification all cytosine will be copied as thymidines but 
5-methylcytosine will be preserved as cytosines. As shown 
in Figure 14, plasmid DNA exposed in vivo to Lac repressor - 
M.SssI fusion proteins underwent methylation at CpG sites 
near the Lac repressor binding site; vector DNA was not 

25 methylated, and plasmid exposed to M.SssI became methylated 
at all CpG sites. These data confirm that fusing a DNA 
methyltransferase to a sequence-specific DNA binding protein 
can limit methylation to CpG sites in the vicinity of the 
recognition sequence of the DNA binding protein as 

30 originally proposed. 



Figure 14 also shows that targeted methylation occurred 20- 
30 nucleotides away from the 3' border of the Lac repressor 
binding site. This is taken to reflect steric interference 
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between protein moieties. The result was expected and is 
not a limitation of the method. 

The Lac repressor-M . SssI fusion proteins confirmed the 
5 principle of targeted methylation. Figure 15 depicts a new 
class of chimeric DNA methyltransf erases that have been 
constructed and confirmed by DNA sequencing. The sequence- 
specific DNA binding moiety is a three-finger zinc finger 
protein (Zif268) of known binding specificity. These agents 
10 are being subjected to the same characterization as the Lac 
repressor-M. SssI fusions. They have also been cloned into 
mammalian expression vectors and experiments in cultured 
primate cells are being performed. 

15 In summary, chimeric DNA methyltransf erase -DNA binding 
proteins have been shown to selectively methylated CpG sites 
in the vicinity of the recognition sequence of the DNA 
binding protein moiety. 

20 B. Inactivation of HIV-1 by CpG me thy la t ion. 

As shown in Figure 16, the LTR of HIV-1 has been cloned 
upstream of two reporter genes, chloramphenicol 
acetyltransferase (CAT) and firefly luciferase (Luc) . Both 
constructs drive high-level expression of the CAT of Luc 
25 reporter genes when transfected into tat -expressing HL2/3 or 
HLTAT human cervical carcinoma cells. 

Methylation is known to suppress transcription from most 
promoters but its effect on HIV-1 LTR-driven transcription 

30 has not been documented in a system that resembles the 
infected cells. Figure 17 shows that CpG methylation of 9 
CpG sites in the HIV-1 LTR reduces transcription to near 
background levels; the effect is at least 20 fold. The 
effect appears to be irreversible and does not require 

35 continued application or expression of any foreign agents. 
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CpG methylation can therefore be regarded as the most 
powerful way to suppress the activity of HIV-l. Work is 
underway that will identify the CpG sites that mediate the 
biological effects of methylation. 

5 

Work as described herein, has shown that M.SssI does not 
methylate single stranded DNA. This has led to the 
development of a new method for the region- specif ic 
methylation of a DNA fragment. A single- stranded 

10 oligonucleotide or longer fragment synthesized by asymmetric 
PCR is annealed to a single -stranded circular DNA and 
treated with M.SssI. CpG sites in the double stranded 
region of the plasmid are methylated while all single 
stranded regions are unmodified. This new method reduces 

15 the time required to identify critical CpG sites. The 
regional methylation method is being used to evaluate the 
effects of CpG methylation on the HIV-l LTR versus the gag 
leader sequence; this latter sequence is rich and has 
strongly conserved CpG sites (Figure 16) and methylation of 

2 0 the sites may suppress transcription of HIV-l as well or 
better than methylation of LTR sequences. 
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TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG ATCTACCACA 60 

CACAAGGCTA CTTCCCTGAT TAGCAGAACT ACACACCAGG GCCAGGAGTC AGATATCCAC 120 

TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA GAAGAGGCCA 180 

ACAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGGATG GATGACCCGG 240 

ATATATAAGT GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC GTGGCCCGAG 300 

AGCTGCATCC GGAGTACTTC AAGAACTGCT GATATCGAGC TTGCTACAAG GGACTTTCCG 360 

CTGGGGACTT TCCAGGGAGG CGTGGCCTGG GCGGGACTGG GGAGTGGCGA GCCCTCAGAC 420 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CCCCCCTGTG AGCCTGCGGG TTTTTTCCCG CAGGCTCACA 
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What is claimed is: 

1. A chimeric protein which comprises a mutated DNA 
methyl transferase portion and a DNA binding protein 
portion that binds sufficiently close to a promoter 
sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate 
the site and inhibit activity of the promoter and thus 
inhibit expression of the target gene. 

2. The protein of claim 1, wherein the promoter sequence 
of the target gene is a 5' long terminal repeat sequence 
of a human immunodeficiency virus-1 proviral DNA. 

3. The protein of claim 1, wherein the target gene 
comprises a retroviral gene, an adenoviral gene, a foamy 
viral gene, a parvoviral gene, a foreign gene expressed 
in a cell, an overexpressed gene, or a misexpressed 
gene. 

4. The protein of claim 1, wherein the chimeric protein 
comprises a zinc three -finger DNA binding polypeptide 
linked to a CpG-specific DNA methyltransf erase 
polypeptide. 

5. The protein of claim 1, wherein the chimeric protein 
comprises a mutated Lex A binding polypeptide linked to 
a cytosine methyltransf erase polypeptide. 

6. The method of claim 1, wherein the mutated DNA 
methyltransf erase portion comprises at least a portion 
of a mutated M.SssI DNA methyltransf erase protein or at 
least a portion of a mutated mammalian DNA 
methyltransf erase protein. 
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7. An expression vector which encodes the chimeric protein 
of claim 1. 

8. The vector of claim 7, wherein the expression vector is 
replicable . 

9. The vector of claim 7, wherein the vector is a pLS 
vector . 

10. The vector of claim 7, wherein the vector is a 
prokaryotic expression vector, a yeast expression 
vector, a baculovirus expression vector, a 
mammalian expression vector, or an episomal 
mammalian expression vector. 

11. A method for inhibiting expression of a target gene 
which comprises contacting a promoter of the target 
gene with the chimeric protein of claim 1 so as to 
specifically methylate the promoter thus inhibiting 
expression of the target gene. 

12. The method of claim 11, wherein the target gene is 
an endogenous target gene. 

13. The method of claim 11, wherein the target gene is 
a foreign target gene. 

14. The method of claim 13, wherein the foreign target 
gene is a retroviral gene or a viral gene. 

15. The method of claim 11, wherein the target gene is 
associated with a cancer, a central nervous system 
disorder, a blood disorder, a metabolic disorder, 
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a cardiovascular disorder, an autoimmune disorder, 
or an inflammatory disorder. 

16. The method of claim 15, wherein the cancer is acute 
lymphocytic leukemia, acute myelogenous leukemia, 
B-cell lymphoma, lung cancer, breast cancer, 
ovarian cancer, prostate cancer, lymphoma, 
Hodgkin's disease, malignant melanoma, 
neuroblastoma, renal cell carcinoma or squamous 
cell carcinoma. 

17. The method of claim 15, wherein the central nervous 
system disorder is Alzheimer's disease, Down's 
syndrome, Parkinson's disease, Huntington's 
disease, schizophrenia, or multiple sclerosis. 

18. The method of claim 15, wherein the infectious 
disease is cytomegalovirus, herpes simplex virus, 
human immunodeficiency virus, AIDS, papillomavirus, 
influenza, Candida albicans, mycobacteria, septic 
shock, or associated with a gram negative bacteria. 

19. The method of claim 15, wherein the blood disorder 
is anemia, hemoglobinopathies, sickle cell anemia, 
or hemophilia. 

20. The method of claim 15, wherein the cardiovascular 
disorder is familial hypercholesterolemia, 
atherosclerosis, or renin/angiotensin control 
disorder. 

21. The method of claim 15, wherein the metabolic 
disorder is ADA, deficient SCID, diabetes, cystic 
fibrosis, Gaucher' s disease, galactosemia, growth 
hormone deficiency, inherited emphysema, Lesch- 
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Nyhan disease, liver failure, muscular dystrophy, 
phenylketonuria, or Tay-Sachs disease. 

22. The method of claim 15, wherein the autoimmune 
disorder is arthritis, psoriasis, HIV, or atopic 
dermatitis . 

23. The method of claim 15, wherein the inflammatory 
disorder is acute pancreatitis, irritable bowel 
syndrome, Chrone's disease or an allergic disorder. 

24. The method of claim 11, wherein the target gene is 
in a cell. 

25. The method of claim 24, wherein the cell is a 
eukaryotic cell, a bacterial cell, an animal cell, 
a plant cell, a prokaryotic cell, a virus packaging 
cell, a somatic cell, a germ cell, a neuronal cell, 
a myocyte, a T lymphocyte, a CD4* cell, a tumor 
cell, a CD4+ cell, or a stem cell. 

26. The method of claim 11, wherein the contacting is 

by means of liposome mediated delivery, retroviral 
delivery, gene bombardment, electroporation or 
cat ionic precipitation. 

27 • A method for inhibiting expression of a target gene 

in a multicellular organism which comprises 
contacting a promoter sequence of the target gene 
with the chimeric protein of claim 1, so as to 
specifically methylate the promoter sequence and 
thus inhibit expression of the target gene in the 
multicellular organism. 
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28. The method of claim 27, wherein the multicellular 
organism is a plant, an animal or a human. 

29. The method of claim 28, wherein the plant is an 
alfalfa plant, a broccoli plant, a rapeseed plant, 
a carrot plant, a chicory plant, a coffee plant, a 
cucurbita plant, a euromelon plant, a potato plant, 
a raspberry plant, a sunflower plant, a tomato 
plant , or a wheat plant . 

30. The method of claim 28, wherein the animal is a 
horse, a primate, a porcine animal, a bovine 
animal, a swine, a fowl, or a fish. 

31. The method of claim 27, wherein the chimeric 
protein or a nucleic acid encoding the chimeric 
protein is delivered to the multicellular organism 
via intralesional, intraperitoneal, intramuscular 
or intravenous injection; liposome-mediated 
delivery; viral infection; gene bombardment; 
topical, nasal, oral, anal, ocular or otic 
delivery. 

32. The method of claim 31, wherein the viral infection 
is via a non- integrating, replication-defective 
virus . 

33. The method of claim 32, wherein the virus comprises 
a replication-defective Human Immunodeficiency Type 
1 provirus, a retroviral vector, an adeno- 
associated virus, a LNL6 vector, a LXSN vector or 
a MMuLV retroviral vector. 

34. A method of treating a subject infected with a 
virus which comprises administering to the subject 
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a therapeutic composition comprising the chimeric 
protein of claim l, or a nucleic acid molecule 
encoding the chimeric protein of claim 1, under 
suitable conditions so as to specifically methylate 
the viral promoter sequence and inhibit expression 
of the viral gene thus treating the subject 
infected with the virus. 

The method of claim 34, wherein the virus is chosen 
from the group consisting of a DNA virus, a 
retrovirus, a herpes virus, an immunodeficiency 
virus, an adeno- associated virus and an adenovirus. 

The method of claim 34, wherein the therapeutic 
composition comprises a nucleic acid molecule 
encoding a mutated Lex A DNA binding protein 
portion linked to a mutated DNA methyltransf erase 
protein portion. 

The method of claim 34, wherein the therapeutic 
composition comprises a nucleic acid molecule 
encoding a tridactyl zinc finger DNA binding 
protein portion capable of specifically binding the 
Human Immunodeficiency Virus Type 1 5 'long terminal 
repeat nucleic acid sequence linked to a mutated 
DNA methyltransf erase protein portion. 

The method of claim 34, wherein the subject is a 
human . 

The method of claim 34, wherein the therapeutic 
composition comprises a replicable expression 
vector chosen from the group consisting of a pLS 
vector, a prokaryotic expression vector, a yeast 
expression vector, a baculovirus expression vector, 
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a mammalian expression vector, and an episomal 
mammalian expression vector, 

40. The method of claim 34, wherein the administration 
comprises intralesional , intraperitoneal , 
intramuscular or intravenous injection; liposome - 
mediated delivery; viral infection; gene 
bombardment; topical, nasal, oral, anal, ocular or 
otic delivery. 

41. The method of claim 40, wherein the viral infection 
is via a non- integrating, replication-defective 
virus . 

42. A host cell comprising the expression vector of 
claim 7. 

43. The host cell of claim 42, wherein the host cell is 
chosen from the group consisting of a eukaryotic 
cell, a somatic cell, a germ cell, a neuronal cell, 
a myocyte, a T lymphocyte, a prokaryotic cell, a 
virus packaging cell, a plant cell, a prokaryotic 
cell, a tumor cell, a stem cell and a CD4+ cell. 

44. A pharmaceutical composition comprising a 
therapeutically effective amount of the expression 
vector of claim 7 and a pharmaceutically acceptable 
carrier. 

45. The pharmaceutical composition of claim 44, wherein 
the carrier comprises a diluent. 

46. The pharmaceutical composition of claim 44, wherein 
the pharmaceutically acceptable carrier is an 
aerosol, intravenous, oral or topical carrier. 
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A transgenic non-human mammal whose somatic and 
germ cells contain and express a DNA coding for a 
chimeric protein of claim 1, the DNA having been 
stably introduced into the non- human mammal at the 
single cell stage or an embryonic stage, and 
wherein the DNA is linked to a promoter and 
integrated into the genome of the non-human mammal. 
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FIGURE 5 
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FIGURE 6 



pLS, An Expression Construct to be Used in the Selection of 
DNA Binding Protein/DNA Methyltransferase Chimeras 
That Methylate Predetermined Sequences 
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FIGURE 9 

Phage-display selection of Zinc-Finger Proteins that Bind to 
Predetermined Sequences in the HIV-1 5' LTR 
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FIGURE 10 

CpG Sites and Virol Transcripts in Hepatitis B Virus 
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FIGURE 11B 
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FIGURE 12 
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FIGURE 13 




LacI-M.SssI Fusion 
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FIGURE 14 
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Background of the Invention 

Throughout this application, various publications are 
15 referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the end 
of the specification immediately preceding the claims. The 
disclosures of these publications in their . entireties are 
hereby incorporated by reference into this application in 
20 order to more fully describe the state of the art as known 
to those skilled therein as of the date of the invention 
described and claimed herein. 

All mammalian promoters tested to date have been found to be 
25 silenced when they contain 5-methylcytosine (m 5 C) at CpG 
sites (reviewed by Bestor, 1990; Meehan et al . , 1993). 
Methylation represses transcription directly by interference 
with binding of transcription factors (Joel et al . , 1993, 
and references therein) and methylated sequences are 
3 0 assembled into condensed chromatin that is inaccessible to 
transcription factors (reviewed by Bird, 1992) . It has been 
shown that methylation of CpG dinucleotides in the 5' long 
terminal repeat (LTR) of HIV-1 suppresses viral 
transcription (reviewed by Bednarik, 1992) . Methylation can 
35 reduce transcription from the HIV-l 5' LTR to undetectable 
levels (Joel et al., 1993). It is also known that 
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methylation patterns are transmitted by clonal inheritance 
(Wigler, 1981) , and that this is due to the strong 
preference of mammalian DNA methyltransf erase for 
hemimethylated DNA (Bestor and Ingram, 1983; Bestor, 1992) . 
5 The heritability of methylation patterns causes the affected 
promoters to be irreversibly inactivated, as has been 
observed for a large number of genes in cultured cells 
(reviewed by Holliday, 1993) and in human tissues (Yoshiura 
et al., 1995; Herman et al . , 1994). While the exact 
10 function of DNA methylation remains the subject of 
discussion, its ability to suppress transcription is not in 
doubt . 

Most exogenous nucleic acid sequences that a resident in a 

15 cell such as transposable elements and proviral DNA are 
methylated and inactivated in the genomes of mammals, 
flowering plants, and those fungi whose genomes contain m 5 C. 
Retroviral DNA is especially prone to inactivation by de 
novo methylation upon germline transmission, and this has 

20 greatly reduced the usefulness of retroviral transducing 
.vectors in the construction of transgenic animals (Jahner 
and Jaenisch, 1984; Jaenisch et al., 1985). Treatment of 
cultured cells or mice with the demethylating drug 5- 
azacytidine (an irreversible inhibitor of DNA 

25 methyltransf erases) can reactivate methylated retrovirus 
genomes (Jaenisch et al., 1985), and endogenous genes that 
have been silenced by ectopic de novo methylation of 
promoter regions can also be reactivated by demethylation 
(Holliday, 1993; Yoshiura et al., 1995). It is notable that 

3 0 organisms whose DNA lacks m 5 C (such as Drosophila) suffer 
far larger numbers of insertion mutations (Ashburner, 1992) , 
which may reflect a reduced capacity for the control of 
mobile elements. These observations, together with 
evolutionary considerations (Bestor, 1990) , strongly suggest 

35 that cytosine methylation is part of a genomic host defense 
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system that limits the proliferation of parasitic sequence 
elements (Bestor, 1990; Bestor and Coxon, 1993) . The 
selective advantage of such a defensive system is obvious, 
given that a significant fraction of the genome represents 
5 exogenous sequences that are invisible to the immune system 
and which might impose a lethal mutagenic or cytotoxic 
burden if allowed to proliferate unchecked. 

The proviral DNA of many retroviruses is propagated in the 
10 repressed, latent state as a result of methylation of LTR 
sequences (reviewed by Jahner and Jaenisch, 1984) . A body 
of evidence indicates methylation can cause latency in cells 
infected with HTLV-1 (Cassens and Ullrich, 1993) or HIV-1 
(Bednarik et al. f 1990; Joel et al., 1993 reviewed by 
15 Bednarik, 1992) . The importance of methylation in latent 
HIV-1 infections in patients is unknown, but as HIV has only 
recently entered human populations, the host -parasite 
relationship may be far from equilibrium (Baltimore, 1995) 
and HIV proviral DNA may be inactivated with low efficiency 
2 0 under normal conditions (McCune, 1995) . It is clear, 
however, that HIV-1 transcription is very sensitive to 
methylation of 5' LTR sequences (reviewed by Bednarik, 
1992) . 



25 
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Summary of the Invention 

The present invention provides a chimeric protein which 
5 comprises a mutated DNA methyltransf erase portion and a DNA 
binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 

10 expression of the target gene. This invention also provides 
for a method for inhibiting the expression of a target gene 
which includes contacting a promoter of the target gene with 
the chimeric protein, so as to specifically methylate the 
promoter sequence of the target gene thus inhibiting 

15 expression of the target gene. 
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Brief Description of the Figures 

Figure 1 . Distribution of CpG sites and transcription 
factor binding sites in the 5' LTR of HIV-1 (Sequence I.D. 

5 No. 1) . CpG sites that fall within the recognition sequence 
of a restriction methyltransf erase or methylation-sensitive 
endonuclease are shown above downward-pointing arrows. Data 
are from Garcia and Gaynor, 1994. 

10 Figure 2 . Sequence of HIV-1 5' LTR. CpG dinucleot ides are 
bold and underlined; CpG sites that lie within the 
recognition sequence of methylation-sensitive restriction 
endonuclease are bold and italicized. Major transcription 
start site is denoted by +1, and the underlined sequence 

15 following denotes the transcribed sequence; numbering is 
with respect to the transcription start site. Please refer 
to Figure 1 for location of protein binding sites. Strike 
through indicates the 9 bp binding site for the 3 finger 
protein developed by Wu et al . (1995). Sequence and 

20 numbering from Garcia and Gaynor, .1994. 

Figure 3 . Gene silencing via targeted DNA methylation. A 

chimera between a sequence-specific DNA binding protein and 
a DNA methyltransferase with attenuated DNA binding directs 
25 methylation only to sites in the vicinity of the recognition 
sequence of the DNA binding protein. 

Figure 4 . Domain organization of mammalian DNA methyl- 
transferase. This enzyme is comprised of a long N- terminal 
30 regulatory domain and a C-terminal catalytic domain that 
closely resembles bacterial restriction methyltransferase. 
See Bestor and Verdine (1994) for a recent review. 



Figure 5 . Protein-DNA contacts in the M . Haelll-DNA 
35 transition state intermediate. The target cytosine is 
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everted from the helix during methyl transfer, and is shown 
in the extra helical position. Thin dotted lines indicate 
base-specific contacts in the major groove, bold lines 
indicate contacts with the sugar-phosphate backbone. 
5 Mutation of residues involved in the sugar-phosphate 
contacts will reduce the affinity of DNA methyl -transferases 
for DNA, as will framework mutations that alter the 
disposition of the contact residues. Such mutations will 
make methyl transfer dependent on the DNA binding protein 
10 moiety of a DNA binding protein/DNA methyltransf erase 
chimera, as described herein. Data from Reinisch et al. 
(1995) . 

Figure 6 . pLS, an expression construct to be used in the 
15 selection of DNA binding protein/DNA me thy ltransf erase 
chimeras that methylate predetermined sequences. Lex A was 

chosen for its ability to direct fused proteins to Lex A 
binding sites on DNA (Brent and Ptashne, 1986) , and M.SssI 
was chosen because of its high specific activity and the 
20 fact that it is the only bacterial methyltransf erase that 
has the same specificity (5'-CpG-3') (Renbaum and Razin, 
1992) as the mammalian enzyme (Bestor and Ingram, 1983). 

Figure 7 » Cyclic in vitro/in vivo selection for DNA binding 
25 protein/DNA methyltransf erase chimeras in which methylation 
is dependent on the DNA binding protein moiety. McrBC en- 
codes a nuclease that degrades heavily methylated DNA. 
Unmethylated DNA that has been linearized by a methylation- 
sensitive endonuclease does not transform. Use of these 
30 selective procedures on combinatorial libraries allows 
selection of chimeric DNA binding protein-DNA 
methyltransf erase that target methylation only to a unique 
target site. This selection scheme may be applied to 
LexA/DNA methyltransf erase chimeras (Example 1) and to zinc 
35 finger/DNA methyltransf erase chimeras (Examples 3-5) . 
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Figure 8 . Identification of CpG sites in the HIV-1 5' LTR 
that yield maximal suppression of transcription when methy- 
lated. Individual 20mer primers that contain single m 5 CpG 
5 sites are hybridized to single -stranded M13 clones that 
contain the HIV-1 LTR and the bacterial CAT reporter gene. 
Extension of the primers with a DNA polymerase (sequenase) 
yields double -stranded DNA that is methylated on one strand 
at one CpG site. Upon transfection the hemimethylated site 
10 rapidly becomes methylated (Busslinger et al., 1983). The 
positions of each of the 11 CpG sites in the HIV-1 5' LTR 
are given with respect to the transcription start site in 
the HIV-1 genome (Garcia and Gaynor, 19 93) . 

15 Figure 9 . Phage-display selection of a zinc finger 

proteins that bind to predetermined sequences in the HIV-1 
5' LTR. The strategy used to construct tridactyl zinc 
finger proteins that bind to specific sequences was 
independently developed by the laboratories of Berg 

20 (Desjarlais and Berg, 1993) , Klug (Choo and Klug, 1994) , 
Pabo (Rebar and Pabo, 1994) # and Barbas (Wu et al . , 1994). 
The schemes are very similar; the one depicted here is that 
of Choo and Klug (1994) . The method has been used to obtain 
proteins that bind to a number of predetermined sequences 

25 with high specificity and affinity. The boxed sequence 
shown in the figure appears 20 base pairs 5' of the first 
Hpall (CCGG) site in the LTR sequence of Figure 2; it is 
shown for illustrative purposes only (Sequence I.D. No. 2). 
The actual target sequence will be determined as described 

3 0 in Example 2. 

Figure 10. Organization and transcription of the HBV genome. 

The viral genome of 3.2 kb is represented by a horizontal 
line. Positions of methylatable CpG sites are shown by 
35 vertical lines. Arrows indicate major viral transcripts. 
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Notice that transcription of each major mRNA initiates in a 
cluster of CpG sites. CpG methylation has been shown to 
suppress viral transcription in transgenic animals (Miller 
and Robinson, 1993; Pourcel et al . , 1990). 

5 

Figures 11A - 11B. Targeted methylation of a preselected CpG 
site . 

(A) A diagram of pLS which encodes a lexA-M.SssI fusion 
protein. A Smal site is located adjacent ot a lexA binding 

10 site in the same plasmid. LexA is predicted to position 
M.SssI over this site, and preferential methylation of 
theSmal site was expected (M.SssI has no intrinsic 
preference for Smal sites) . The positions of other 
restriction sites are shown approximately to scale; other 

15 features of the plasmid are omitted for the sake of clarity. 

(B) Data confirms targeted methylation. Odd-numbered lanes 
contain plasmid that encodes enzymatically active fusion 
protein. The unmethylated state cuses the DNA to be cleaved 
to the linear form. Notice that the Smal site is largely 

20 resistant to cleavage in lane 8 (compare to lanes 7 and 10) , 
while the other sites remain sensitive. The methylation- 
insensitive endonulease Ncol was used as a control. Heavy 
methylation is observed only at the Smal site; little or no 
detectable methylation is seen at Sail and Xhol sites (lanes 

25 4 and 6) . Lin: linear unit-length DNA (unmethylated) . CCC, 
covalently closed circular DNA (methylated in lane 88) . 
Lane 11 contains length markers. The diffuse band that 
migrates above the largest length marker is bacterial 
chromosomal DNA. 

Figure 12. Construction of a plasmid that performs targeted 
methylation. pLM9 was constructed by cloning an Lac 
repressor-M.SssI fusion gene between the Pvull sites of 
pBluescript. Adjacent to the fusion gene is a synthetic 
35 methylation target sequence, LacO-CpGs, which contains the 
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Lac operator (to which Lac repressor binds) and a number of 
CpG target sites. These are 12 amino acids in the linker 
between LacI and M.SssI. Expression of the fusion is 
controlled by the native promoter of lad gene. 

5 

Figure 13. Expression of LacI-M.SssI fusion in E. coli XL1- 
Blue [pLM9] . The targeted methyltransf erase is marked with 
an arrow. The antibody used was anti-Lad. An intact 
fusion protein was produced in good yield. Lane marked pBS 
10 contained proteins extracted from cells that contained only 
the expression vector. 

Figure 14. Targeted methylation demonstrated by bisulfite 
sequencing analysis. The methylation target shown in Figure 

15 12 was isolated from cells expressing LacI -M.SssI fusion 
proteins. The bisulfite sequencing method was used to 
identify methylated sites. pLM9-l and -2 are two sister 
bisulfite cones carrying BS-modified methylation target 
sequence derived from a pLM9 mutant that has attenuated 

20 activity. Methylation is limited to CpG sites in the 
immediate vicinity of the binding site of the sequence 
specific DNA binding protein Lad . pBS is the precursor of 
pLM9 and has no lacI/M.SssI gene; no methylated sites are 
present. pM is a pBS derivative that encodes fully active 

25 (non-targeted) M.SssI gene; all sites are methylated. These 
data confirm that targeted methylation has been used to 
direct methylation to CpG sites in the vicinity of the 
binding site of a sequence -specif ic DNA binding protein. 

30 Figure 15. Zinc-finger (Zif) targeted methyltransf erases . 

The above Zif fusion constructs have been made and 
expression of the appropriate fusion proteins has been 
confirmed by immunoblot and methylation analysis. The 
coding regions have also been transferred into the mammalian 
35 expression vector pcDNA3 . 1/His/A. 
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Figure 16, HIV-1 LTR constructs used to analyze targeted 
methylation. Plasmid pLTR-CAT (top) contains the U3 region 
and 80% of the R region of the HIV-1 proviral long terminal 
repeat (LTR) driving expression of the chloramphenicol 
5 acetyltransf erase (CAT) reporter gene. These elements 
contain the HIV-1 core promoter, enhancer, negative 
regulatory element, and trans-activated region (TAR) . These 
elements also contains 10 CpG dinucleotides (shown as 
circles above the LTR) f which are targets for methylation. 

10 Plasmid pLG-luc (bottom) contains the entire proviral LTR 
and gag leader sequence (GLS) driving expression of the 
firefly lucif erase reporter gene. The 150 bp GLS, which 
lies between the LTR and the first viral gene, contains an 
additional 14 CpG dinucleotides and has been shown to be 

15 important for viral expression. The circles are shaded (see 
box) to indicate the conservation of the individual CpG 
sites in different HIV-1 sequences of class B, the class 
affecting the vast majority of infected individuals in the 
United States and Western Europe. Transcription factor 

20 binding sites are labeled below the LTR and hatched when 
they contain a CpG site. The LTR is flanked by a short 
' region of human genomic DNA * present -in the original HIV 
clone. 

25 Figure 17. Complete suppression of HIV-1 transcription by 
LTR methylation. Methods: pLTR - CAT was treated with M.SssI 
to methylate all CpG sites. Methylated or unmethylated 
pLTR-CAT was cotransf ected with unmethylated pLG-luc into 
HLtat cells and lysates were prepared at the times 

30 indicated. To control for the efficiency of transfection 
and recovery, volumes of lysate containing equivalent 
luciferase activity were assayed for CAT activity using the 
FASTCAT assay kit. Reaction products were quantitated using 
NIH Image. "CAT Activity" indicates conversion of 1- 

35 deoxychloramphenicol to 3 -acetyl -1-deoxychloramphenicol , 
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normalized to arbitrary units. Control cells were 
transfected with pLG-luc only and harvested 48 hours post- 
transf ection . 
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Detailed Description of the Invention 

The present invention provides for a chimeric protein which 
comprises a mutated DNA methyl transf erase portion and a DNA 
5 binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 
expression of the target gene. 

10 

As used herein, "chimeric protein" is a protein including at 
least two portions of two proteins positioned adjacent to 
each other so that the coding sequences of both portions are 
in frame and can be translated into one polypeptide. The 
15 chimeric protein may include a portion of a DNA binding 
protein and a portion of a mutated or wild type DNA 
methyltransf erase protein. The two portions of the chimeric 
protein may be linked with intermediate sequences of 8,12,16 
or 20 codons expressed as part of the chimeric protein. 

20 

As used herein, "adjacent" includes two sequences that are 
positioned directly next to one another so that the codons 
of each sequence are in frame and contiguous. "Adjacent" 
also includes two sequences that are nearby one another 
25 linked to intermediate sequences and are in frame and can be 
translated into one polypeptide. 

As used herein "linked to" encompasses a means of joining 
two separate portions of a protein, each of which may have 

30 a separate function. The means of joining the functional 
portions of the protein may include a covalent or 
noncovalent association between functional polypeptides via 
an organic substance, an inorganic substance, a nucleotide, 
a polynucleotide, a peptide nucleic acid, a peptide, a 

35 triplex peptide, an electrostatic means, a triplex nucleic 
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acid, or a self -assembling biomaterial . 

As used herein, "DNA methyltransferase" is a protein which 
is capable of methylating a particular DNA sequence, which 
5 particular DNA sequence may be -CpG- . This protein may be 
a mutated DNA methyltransf erase, a wild type DNA 
methyltransf erase, a naturally occurring DNA 
methyltransf erase, a variant of a naturally occurring DNA 
methyltransf erase, a truncated DNA methyltransf erase , or a 

10 segment of a DNA methyltransf erase which is capable of 
methylating DNA. The DNA methyltransf erase may include 
mammalian DNA methyltransf erase , bacterial DNA 
methyltransf erase, M.SssI DNA methyltransf erase and other 
proteins or polypeptides that have the capability of 

15 methylating DNA. 

As used herein, "mutated DNA methyltransf erase protein 
portion" is a segment of a DNA methyltransf erase protein or 
polypeptide containing an amino acid sequence which is 

20 different from the wild type DNA methyltransf erase amino 
acid sequence. A mutated DNA methyltransf erase protein 
portion may . include a segment, of a. naturally occurring 
variant of the wild type DNA methyltransf erase protein, a 
segment of a DNA methyltransf erase with a sequence altered 

25 from the wild type sequence, a truncated protein or portion 
of protein or polypeptide that is capable of methylating 
DNA. This mutated DNA methyltransf erase may be altered at 
one or more of its amino acids and retains part of the wild 
type methyltransferase activity, although the activity of 

30 the mutated DNA methyltransferase protein portion may be 
attenuated activity. 

As used herein, "DNA binding protein portion" is a segment 
of a DNA binding protein or polypeptide capable of 
35 specifically binding to a particular DNA sequence. The 
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binding is specific to a particular DNA sequence site. The 
DNA binding protein portion may include a truncated segment 
of a DNA binding protein or a fragment of a DNA binding 
protein . 

5 

As used herein, "binds sufficiently close" means the 
contacting of a DNA molecule by a protein at a position on 
the DNA molecule near enough to a predetermined methylation 
site on the DNA molecule to allow proper functioning of the 
10 protein and allow specific methylation of the predetermined 
methylation site. 

As used herein, "a promoter sequence of a target gene" is at 
least a portion of a non-coding DNA sequence which directs 
15 the expression of the target gene. The portion of the non- 
coding DNA sequence may be in the 5' -prime direction or in 
the 3 '-prime direction from the coding region of the target 
gene. The portion of the non-coding DNA sequence may be 
located in an intron of the target gene . 

20 

The promoter sequence of the target gene may be a 5' long 
^terminal repeat sequence of a human immunodeficiency virus -1 
proviral DNA. The target gene may be a retroviral gene, an 
adenoviral gene, a foamy viral gene, a parvo viral gene, a 
25 foreign gene expressed in a cell, an overexpressed gene, or 
a misexpressed gene. 

As used herein "specifically methylate" means to bond a 
methyl group to a methylation site in a DNA sequence, which 
30 methylation site may be -CpG-, wherein the methylation is 
restricted to particular methylation site(s) and the 
methylation is not random. 

In one embodiment of this invention, the chimeric protein 
3 5 may be a zinc three -finger DNA binding polypeptide linked to 
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a CpG-specific DNA methyltransf erase protein portion. The 
chimeric protein may comprise a mutated Lex A DNA binding 
protein or polypeptide linked , to a cytosine 
methyl transferase protein portion or polypeptide. 

5 

The chimeric protein may include at least a portion of a 
mutated M.SssI DNA methyltransf erase protein or at least a 
portion of a mutated mammalian DNA methyltransf erase . 

10 In one embodiment of this invention, an expression vector 
may code for the chimeric protein described hereinabove . 
The expression vector may be a replicable vector and may 
include a pLS vector, a prokaryotic expression vector, a 
yeast expression vector, a baculovirus expression vector, a 

15 mammalian expression vector, or an episomal mammalian 
expression vector. 

As used herein, "expression vector" is a nucleic acid 
molecule which is capable of having a coding sequence of DNA 
20 or RNA ligated into it and then expressing that sequence 
when introduced into an appropriate host cell . An 
-expression vector, may . be a mammalian expression vector, or a 
prokaryotic expression vector. 

25 Another embodiment of this invention is a method for 
inhibiting the expression of a target gene which includes 
contacting a promoter sequence of the target gene with the 
chimeric protein described hereinabove, so as to 
specifically methylate the promoter sequence of the target 

3 0 gene thus inhibiting expression of the target gene. In this 
embodiment, the target gene may be an endogenous target gene 
which is native to a cell or a foreign target gene. The 
foreign gene may be a retroviral target gene or a viral 
target gene. 
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The target gene in this embodiment may be associated with a 
cancer, a central nervous system disorder, a blood disorder, 
a metabolic disorder, a cardiovascular disorder, an 
autoimmune disorder, or an inflammatory disorder. The 
5 cancer may be acute lymphocytic leukemia, acute myelogenous 
leukemia, B-cell lymphoma, lung cancer, breast cancer, 
ovarian cancer, prostate cancer, lymphoma, Hodgkin's 
disease, malignant melanoma, neuroblastoma, renal cell 
carcinoma or squamous cell carcinoma. The central nervous 

10 system disorder may be Alzheimer's disease, Down's syndrome, 
Parkinson's disease, Huntington's disease, schizophrenia, or 
multiple sclerosis. The infectious disease may be 
cytomegalovirus, herpes simplex virus, human 

immunodeficiency virus, AIDS, papillomavirus, influenza, 

15 Candida albicans, mycobacteria, septic shock, or associated 
with a gram negative bacteria. The blood disorder may be 
anemia, hemoglobinopathies, sickle cell anemia, or 
hemophilia. The cardiovascular disorder may be familial 
hypercholesterolemia, atherosclerosis, or renin/angiotensin 

2 0 control disorder. 

The metabolic' disorder may be ADA, deficient SCID, diabetes, 
cystic fibrosis, Gaucher' s disease, galactosemia, growth 
hormone deficiency, inherited emphysema, Lesch-Nyhan 
25 disease, liver failure, muscular dystrophy, phenylketonuria, 
or Tay-Sachs disease. The autoimmune disorder may be 
arthritis, psoriasis, HIV, or atopic dermatitis. The 
inflammatory disorder may be acute pancreatitis, irritable 
bowel syndrome, Chrone's disease or an allergic disorder. 

30 

In one embodiment of the subject invention promoter 
sequences of the target genes listed below may be 
therapeutic target genes of the subject invention: dominant 
oncogenes, c-MYB, c-MYC, c-SIS, ERB A, ERB B, ERB B-l, ERB 
35 B-2, HER-2/NEU, c-SRC, c-YES, c-FPS, c-FES, c-FOS, c-JUN, c- 
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ROS, c-ABL, c-FGR, c-MOS, c-RAS, c-RAF, c - MET , c-ETS, BCL-1, 
BCL-2, ETS, c-FMS, c-FES, c-BLK, TCL-1, TCL-2, TCL-3, TCL-5, 
ALL- l/HRX/MLL , PML promoter in PML/RAR-cr fusion, RAR-a 
promoter in RAR-a/PML fusion, and NF-1. 

5 

Genes that are overexpressed in cancer cells are also target 
genes of the subject invention. Inhibiting the expression 
of these target genes may reduce tumorigenesis and/or 
metastasis and invasion . Cancer related genes include : 
10 collagenase 92 Kd Type 4, collagenase 72 Kd Type 4, 
osteopontin, calcyclin, fibroblast growth factor, epidermal 
growth factor, matrilysin and stromolysin. 

Viruses that establish chronic infections and which are 
15 involved in cancer or chronic diseases are also target genes 
of the subject invention. Virus that have possible target 
genes include hepatitis C, hepatitis B, varicella, herpes 
simplex types I and II, Epstein-Barr virus, cytomegalovirus, 
JC virus and BK virus. 

20 

In a preferred embodiment of the subject invention, the 

promoter sequence of. .. viral protein- X and... the promoter.. - 

sequence of pre-S2/5 of the hepatitis B virus may be target 
genes . 

25 

In another embodiment of the subject invention, genes whose 
expression is involved in neurodegenerative disease may be 
target genes of the subject invention. These target genes 
may include beta amyloid precursor protein and the prion 

3 0 protein both of which are shown to be involved in 
Alzheimer's disease. In another embodiment of the subject 
invention, genes which are involved in the etiology of 
acromegaly may be target genes of the subject invention. 
These target genes may include somatostatin, growth -hormone 

35 releasing hormone and constitutive G-stimulatory protein 
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mutants . 

The target gene may be in a cell. The cell may be a 
bacterial cell, an animal cell or a plant cell. The cell 
5 may be a eukaryotic cell, a prokaryotic cell, a virus 
packaging cell, a somatic cell, a germ cell, a neuronal 
cell, a myocyte, a T lymphocyte, a CD4 + cell, a tumor cell, 
a CD4+ cell, or a stem cell. 

10 The chimeric protein may be a mutated Lex A DNA binding 
protein portion or polypeptide linked to a cytosine DNA 
methyltransf erase protein portion or polypeptide. 

The DNA methyltransf erase protein portion may be M.SssI DNA 
15 methyltransf erase protein or the mammalian DNA 
methyltransf erase protein. The chimeric protein may include 
a tridactyl zinc finger protein or polypeptide capable of 
specifically binding the HIV-1 5'LTR nucleic acid sequence 
linked to a cytosine methyltransf erase polypeptide. The 
20 cytosine DNA methyltransf erase polypeptide may be a M. SssI 
DNA methyltransf erase protein or polypeptide or at least a 
portion of the mammalian DNA methyltransf erase' protein 

The contacting may be by means of liposome mediated 
25 delivery, retroviral delivery, gene bombardment, 
electroporation, electronic pulse delivery, air-gun 
injection of a nucleic acid-coated pellet, a self -assembling 
nanocrystalline composition, a cytofectin analog, 
condensation of the nucleic acid, receptor mediated gene 
3 0 transfer, glycosylated macromolecular carriers, polar 
(glyco) lipids, {glyco) peptides , synthetic polymers, a 
triplex nucleic acid, naked nucleic acid transfer, particle- 
mediated nucleic acid transfer or cationic precipitation. 

35 A further embodiment of this invention is a method for 
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inhibiting expression of a target gene in a multicellular 
organism which includes contacting a promoter sequence of 
the target gene with the chimeric protein described 
hereinabove or a nucleic acid molecule encoding the chimeric 
5 protein, so as to specifically methylate the promoter 
sequence of the target gene and thus inhibit expression of 
the target gene in the multicellular organism. 

The multicellular organism may be a plant, an animal or a 
10 human. The plant may be an alfalfa plant, a broccoli plant, 
a rapeseed plant, a carrot plant, a chicory plant, a coffee 
plant, a cucurbita plant, a euromelon plant, a potato plant, 
a raspberry plant, a sunflower plant, a tomato plant, or a 
wheat plant. 

15 

The administration of the chimeric protein to the 
multicellular organism may comprise intralesional , 
intraperitoneal, intramuscular or intravenous injection; 
liposome -mediated delivery; viral infection; gene 

20 bombardment; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non-integrating, 
replication-defective virus. The virus may comprise a 
replication-defective HIV-1 provirus, a retroviral vector, 
an adeno-associated virus, a N2 retroviral vector, a SIM 

25 retroviral vector, a LNL6 vector, a LXSN vector or a MMuLV 
retroviral vector. 

The process by which a plant or animal is rendered resistant 
to viral infection comprises introducing into the plant or 

30 animal a construct which on translation gives rise to the 
above-mentioned chimeric protein. The introduction of the 
nucleic acid molecule is accomplished by genetic 
transformation of a part of the plant by a DNA sequence 
coding for the nucleic acid molecule, followed by the 

35 regeneration of a transgenic plant. The transformation is 
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carried out by the intermediary of Agrobacterium tumefaciens 
or Aarobacterium rhizoqenes . (U.S. Patent Nos. 5,107,065 and 
5,188, 958) . 

5 The present invention is further directed to a DNA construct 
for a plant, the construct comprising a genetic sequence and 
a promoter capable of directing expression of the genetic 
sequences wherein the genetic sequence on expression 
provides at least a portion of a sequence specific DNA 

10 binding protein linked to at least a portion of a mutated 
DNA methyl transferase protein. The DNA construct may 
further be part of a DNA transfer vector suitable for 
transferring the DNA construct into a plant cell and 
insertion into a plant genome. In an embodiment of the 

15 present invention, the DNA construct is carried by broad 
host range plasmid pGA470 which is capable of transformation 
into plant cells using Aarobacterium . The present 
invention, however, extends to other means of transfer such 
as genetic bullets (e.g. DNA-coated tungsten particles, 

20 high-velocity micro projectile bombardment) and 
electroporation amongst others (Maliga, 1993; Bryant, 1992; 
or Shimambto, 1989) . 

The gene encoding the chimeric protein, having been 
25 introduced into the nonhuman mammal, or an ancestor of the 
nonhuman mammal at the single cell stage or an embryonic 
stage, is linked to a promoter and integrated into the 
genome of the nonhuman mammal. One skilled in the art would 
be familiar with the experimental methods necessary to 
30 produce a transgenic mammal, as described in Leder et al . , 
U. S. Patent No. 4,736,866 and Krimpenfort and Berns, U. S. 
Patent No. 5,175,384 and Wagner and Chen, U. S. Patent No. 
5, 175, 385. 

3 5 A further embodiment of the subject invention is a method of 



WO 97/11972 



PCT/US96/15576 



-21- 

treating a subject infected with a virus which comprises 
administering to the subject a therapeutic composition 
comprising a chimeric protein, or a nucleic acid molecule 
encoding the chimeric protein, capable of sequence specific 
5 methylation of a site(s) in a promoter sequence of a target 
viral gene under suitable conditions so as to specifically 
methylate the promoter sequence of the target gene thereby 
inhibiting expression of the target gene and thus treating 
the subject infected with the virus. Examples of viruses 
10 are listed herein. 

The therapeutic composition may comprise a nucleic acid 
encoding at least a portion of a mutated Lex A DNA binding 
protein or polypeptide linked to at least a portion of a 

15 mutated DNA methyltransf erase protein or polypeptide. The 
therapeutic composition may be a nucleic acid encoding a 
tridactyl zinc finger DNA binding protein or polypeptide 
capable of specifically binding the HIV-1 5' LTR nucleic 
acid sequence linked to at least a portion of a mutated DNA 

20 methyltransf erase protein or polypeptide. The therapeutic 
composition may comprise a nucleic acid encoding at least a 
portion of a - sequence specific DNA binding protein or 
polypeptide linked to a mutated DNA methyltransf erase 
protein or polypeptide. 

25 

The subject receiving the therapeutic composition of this 
invention may be a human or a non-human mammal. 

The therapeutic composition may comprise a replicable 
3 0 expression vector chosen from the group consisting of a pLS 
vector, a prokaryotic expression vector, a yeast expression 
vector, a baculovirus expression vector, a mammalian 
expression vector, and an episomal mammalian expression 
vector. The administration may comprise intralesional , 
35 intraperitoneal, intramuscular or intravenous injection; 
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liposome -mediated delivery; viral infection; gene 
bombardment ; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non- integrating, 
replication-defective virus. 

5 

In one preferred embodiment of the method above the nucleic 
acid molecule encoding a chimeric protein is incorporated 
into a liposome to allow for administration to the subject. 
Methods of incorporation of nucleic acid molecules into 

10 liposomes are well known to those of ordinary skill in the 
art. In another embodiment of this method, the nucleic acid 
encoding the chimeric protein may be delivered via 
transf ection, injection, or viral infection. There are 
several protocols for human gene therapy which have been 

15 approved for use by the Recombinant DNA Advisory Committee 
(RAC) which conform to a general protocol of target cell 
infection and administration of transfected cells (see for 
example, Blaese, R.M., et al., 1990; Anderson, W. F., 1992; 
Culver, K.W. et al., 1991). In addition, U.S. Patent No. 

20 5,399,346 {Anderson, W. F. et al . , issued March 21, 1995) 
describes procedures for retroviral gene transfer. The 
contents of these support references are incorporated in 
their entirety into the subject application. Retroviral- 
mediated gene transfer requires target cells which are 

25 undergoing cell division in order to achieve stable 
integration hence, cells are collected from a subject often 
by removing blood or bone marrow. 

Several methods have been developed over the last decade for 
3 0 the transduction of genes into mammalian cells for potential 
use in gene therapy. In addition to direct use of plasmid 
DNA to transfer genes, episomal vectors, retroviruses, 
adenoviruses, parvoviruses, and herpesviruses have been used 
(Anderson et al . , 1995; Mulligan, 1993; The contents of 
35 which are incorporated in their entirety into the subject 
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application) . For transfer of genes into cells ex vivo and 
subsequent reintroduction into a host, as would be most 
feasible in immunodeficiency patients, retroviruses have 
been the vectors of choice. 

5 

A separately preferred embodiment of the invention is a 
replicable expression vector encoding a chimeric protein 
capable of sequence specific DNA methylation. 

10 The replicable expression vector may be chosen from the 
group consisting of a pLS vector, a prokaryotic expression 
vector, a yeast expression vector, a baculovirus expression 
vector, a mammalian expression vector, and an episomal 
mammalian expression vector. The agent may comprise a 

15 mutated Lex A binding polypeptide linked to a 
methyltransf erase polypeptide. The agent may comprise a 
tridactyl zinc finger polypeptide that specifically binds to 
the HIV-l 5'LTR nucleic acid sequence linked to a 
methyltransf erase polypeptide . 

20 

Another embodiment of the invention is a pharmaceutical 
composition comprising a therapeutically effective amount of 
the replicable vector described hereinabove and a 
pharmaceutically acceptable carrier. The carrier may 
25 comprise a diluent. The pharmaceutically acceptable carrier 
may be an aerosol, intravenous, oral or topical carrier and 
is further described herein. 

Another embodiment of the subject invention is a method of 
3 0 obtaining an expression vector including. DNA encoding a 
desired chimeric protein that includes at least a portion of 
a mutated DNA methyltransf erase protein and at least a 
portion of a DNA binding protein such chimeric protein being 
capable of inhibiting the expression of a target gene which 
35 comprises: (a) obtaining a population of expression vectors 



WO 97/11972 



PCT/US96/15576 



-24- 

each of which expresses a chimeric protein including (i) a 
DNA corresponding to an endogenous promoter sequence of the 
target gene, which promoter contains at least one sequence 
specific methylation site; (2) a DNA encoding a mutated DNA 
5 methyltransf erase ; and (3) a DNA encoding a DNA binding 
protein, position adjacent the DNA of (2) , DNA (2) and (1) 
being so positioned in the expression vectors as to permit 
expression of the chimeric proteins; (b) introducing the 
population of vectors from step (a) into an appropriate host 

10 under conditions such that the chimeric proteins are 
expressed and methylate DNA; ( c) isolating the population 
of vectors from step (b) from the host; (d) treating the 
population of vectors from step { c) with a suitable 
restriction endonuclease so as to digest the DNA at a 

15 specific site in the promoter if the site is not methylated; 
(e) introducing the population from step (d) into an 
appropriate host, which hosts have the property that they 
degrade DNA which has been non-specif ically methylated at 
sites other than the sequence specific methylation site 

20 contained in the promoter; and (f) culturing the hosts from 
step (e) under conditions such that vectors which have not 
been degraded express either a mutated or a non-mutated form 
of the chimeric protein, thus obtaining at least one nucleic 
acid encoding a mutated form of a chimeric DNA 

25 methyltransf erase/DNA binding protein capable of inhibiting 
the expression of the target gene. 

As used herein, "desired" includes optimal properties of a 
chimeric protein as described hereinabove. The properties 
30 include specifically methylating a DNA sequence in a 
promoter sequence of a target gene and inhibiting the 
activity of the promoter and thus inhibiting expression of 
the target gene. 

35 In the practice of any of the methods of the invention or 
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preparation of any of the pharmaceutical compositions an 
"effective amount" is an amount which is effective to 
produce the sequence specific methyltransf erase and 
inactivation of the target gene. Accordingly, the effective 
5 amount will vary with the subject being treated, as well as 
the condition to be treated. For the purposes of this 
invention, the methods of administration are to include, but 
are not limited to, administration cutaneously, 
subcutaneously, intravenously, parenterally, orally, 
10 topically, or by aerosol. 

As used herein, the term "suitable pharmaceutical^ 
acceptable carrier" encompasses any of the standard 
pharmaceutical^ accepted carriers, such as phosphate 

15 buffered saline solution, water, emulsions such as an 
oil/water emulsion or a triglyceride emulsion, various types 
of wetting agents, tablets, coated tablets and capsules. An 
example of an acceptable triglyceride emulsion useful in 
intravenous and intraperitoneal administration of the 

2 0 compounds is the triglyceride emulsion commercially known as 
Intralipid® . 

Typically such carriers contain excipients such as starch, 
milk, sugar, certain types of clay, gelatin, stearic acid, 

2 5 talc, vegetable fats or oils, gums, glycols, or other known 

excipients . Such carriers may also include flavor and color 
additives or other ingredients. 

Also comprehended by the invention are pharmaceutical 

3 0 compositions comprising therapeutically effective amounts of 

polypeptide products of the invention together with suitable 
diluents, preservatives, solubilizers , emulsifiers, 
adjuvants and/or carriers. A "therapeutically effective 
amount" as used herein refers to that amount which provides 
35 a therapeutic effect for a given condition and 
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administration regimen. Such compositions are liquids or 
lyophilized or otherwise dried formulations and include 
diluents of various buffer content (e.g., Tris-HCl., 
acetate, phosphate) , pH and ionic strength, additives such 
5 as albumin or gelatin to prevent absorption to surfaces, 
detergents (e.g., Tween 20, Tween 80, Pluronic F68, bile 
acid salts). solubilizing agents (e.g., glycerol, 
polyethylene glycerol), anti-oxidants (e.g., ascorbic acid, 
sodium metabisulf ite) , preservatives (e.g., Thimerosal, 

10 benzyl alcohol, parabens) , bulking substances or tonicity 
modifiers (e.g., lactose, mannitol) , covalent attachment of 
polymers such as polyethylene glycol to the protein, 
complexation with metal ions, or incorporation of the 
material into or onto particulate preparations of polymeric 

15 compounds such as polylactic acid, polglycolic acid, 
hydrogels, etc, or onto liposomes, microemulsions, micelles, 
unilamellar or multilamellar vesicles, erythrocyte ghosts, 
or spheroplasts . Such compositions will influence the 
physical state, solubility, stability, rate of in vivo 

20 release, and rate of in vivo clearance. The choice of 
compositions will depend on the physical and chemical 
properties of the protein. For example, a product derived 
from a membrane -bound form of a protein may require a 
formulation containing detergent. Controlled or sustained 

25 release compositions include formulation in lipophilic 
depots (e.g., fatty acids, waxes, oils). Also comprehended 
by the invention are particulate compositions coated with 
polymers (e.g., poloxamers or poloxamines) and coupled to 
antibodies directed against tissue-specific receptors, 

30 ligands or antigens or coupled to ligands of tissue-specific 
receptors. Other embodiments of the compositions of the 
invention incorporate particulate forms protective coatings, 
protease inhibitors or permeation enhancers for various 
routes of administration, including parenteral, pulmonary, 

35 nasal and oral. 
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Another embodiment of the subject invention is a transgenic 
non-human mammal whose somatic and germ cells contain and 
express a gene coding for a desired chimeric protein which 
is capable of inhibiting the expression of a target gene, 
5 the gene having been stably introduced into the non-human 
mammal at the single cell stage or an embryonic stage, and 
wherein the gene is linked to a promoter and integrated into 
the genome of the non-human mammal. The desired chimeric 
protein may be the desired chimeric protein described 
10 hereinabove. One skilled in the art would be familiar with 
the experimental methods necessary to produce a transgenic 
mammal, as described in Leder et al . , U. S. Patent No. 
4,736,866 and Krimpenfort and Berns , U. S. Patent No. 
5,175,384 and Wagner and Chen, U. S. Patent No. 5,175,385. 

15 

If it were possible to stimulate de novo methylation of HIV 
proviral DNA, then clonal inheritance of methylation 
patterns would render the inactivation essentially 
irreversible. As described herein, this irreversible 
20 inactivation is the basis of a therapeutic approach that has 
practical advantages over alternative molecular biological 

approaches (including antisense RNA, ribozymes, targeted . . i 

nucleases, and intracellular anti-HIV antibodies; reviewed 
by Volberding, 1995) . Provirus silencing by targeted 

25 cytosine methylation stimulates an existing host defense 
system and as an analogy, has the virtues of a vaccine: the 
clonal inheritance of methylation patterns is like the 
immunological memory, and the strong suppression of 
transcription is like the immune defense system in that both 

30 perturb the function of particular gene products. 

It is important to point out that the therapeutic agent that 
methylates the target sequence need be present only 
transiently; the new methylation pattern would be propagated 
35 indefinitely by the cellular DNA methylating system (Bestor, 
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1990; Bestor, 1992; Bestor and Coxon, 1993; Bestor and 
Verdine, 1994) . The alternate molecular biological 
approaches to medical therapies mentioned above, require 
indefinite expression of the agent. This has proven to be 
5 difficult in practice (Challita and Kohn, 1994) and may be 
toxic or may provoke a harmful immune response. An 
additional complication of long-term expression is 
insert ional mutagenesis, while inactivation by targeted 
methylation as described herein may involve delivery of the 
10 agent by a nonintegrating, replication-defective virus that 
establishes an abortive infection. None of the alternative 
molecular biological therapies discussed above has these 
advantages . 

15 The 5' LTR of HIV-1 may be inactivated by targeted cytosine 
methylation. As shown in Figure 1 below, the LTR that 
directs transcription of the provirus is rich in binding 
sites for transcription stimulatory proteins (Nabel, 1993; 
Garcia and Gaynor, 1994) and rich in CpG sites, the 

20 predominant site of methylation in vertebrates. It is also 
well-established that methylation of only a few of the CpG 
sites in the 5' LTR represses transcription to nearly 
undetectable levels {Bednarik et al . , 1990; Joel et al . , 
1993, reviewed by Bednarik, 1992), but the specific, 

25 critical sites that mediate this effect have not been 
identified . 

Mammalian promoters are silenced by cytosine-5 methylation 
at 5'-CpG-3' dinucleotides. The promoter within the 5' LTR 
3 0 of HIV-1 may be silenced by directing enzymatic methylation 
to critical CpG sites. Because methylation patterns are 
subject to clonal inheritance, the silencing is essentially 
irreversible . 

3 5 The subject invention encompasses but is not limited to the 
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following additional embodiments. 

1. Methylation of a predetermined CpG site. In one 
embodiment of the subject invention a sequence-specific DNA 

5 binding protein may be fused to a DNA cytosine 
methyltransf erase, and mutant fusion proteins whose 
methyltransferase activity is dependent on the DNA binding 
protein may be selected from combinatorial libraries. DNA 
binding protein/DNA methyltransferase chimeras that 

10 methylate only those CpG sites immediately adjacent to the 
binding site may be obtained by means of a novel cyclic in 
vivo/in vitro selection protocol. An attenuated version of 
the CpG- specific DNA methyltransferase M.SssI may be 
constructed in this way. In vitro selection requires 

15 resistance to methylation-sensitive restriction endonuclease 
cleavage at the target site, and in vivo selection removes 
plasmid molecules methylated at non-specific sites; this 
second step involves growth in McrBC^s trains of E . coli t 
which degrade heavily methylated DNA. 

20 

2. Identification of those CpG sites in the HIV-1 5' LTR 
whose, methylation produces maximal repression of transcrip- " 

tion. The documented inhibitory effects of methylation on 
HIV-l transcription are likely to be exerted through a 

25 subset of the 11 CpG sites in the 5' LTR of HIV-1. In a 
further embodiment of the subject invention, these critical 
CpG sites may be identified by transcription assays with 
test constructs that bear singly methylated CpG sites; each 
of the 11 CpG sites can be tested individually. The 

30 information obtained may be important for the selection of 
target sites in Examples 3-5. 

3. Design, selection, and affinity maturation of zinc 
finger-DNA methyltransferase chimeras that methylate 

5 critical CpG sites in the HIV 5' LTR. In another embodiment 



WO 97/11972 



PCT/US96/15576 



-30- 

of the subject invention, zinc finger modules may be 
catenated in novel combinations to yield proteins that bind 
with high affinity and specificity to predetermined 
sequences. Zinc finger proteins that bind to sites adjacent 
5 to critical CpG sites (identified in Example 2) may be 
selected from combinatorial expression libraries and fused 
to an attenuated CpG-specific DNA methyltransf erase 
(developed in Example 1) . Further rounds of cyclic in 
vivo/ in vitro selection (as developed in Example 1) may be 
10 used to obtain variants that methylate only CpG sites 
adjacent to the binding site of the zinc finger moiety. 

4. Inhibition of HIV-1 5' LTR- dependent transcription in 
cultured human cells that express zinc finger-DNA methyl - 

15 transferase fusion proteins of novel and predetermined 
specificity. Example 3 yields constructs that methylate 
critical sites in the HIV-1 5' LTR. In a further embodiment 
of the subject invention, these constructs may be introduced 
into cultured human cells that express a reporter gene 

20 driven by the HIV-1 5' LTR as a means of testing the 
inhibitory effects of targeted de novo methylation. 



5. Inhibition of HIV-1 replication in human T lymphocytes 
productively infected with HIV-1. In another embodiment of 

25 the subject invention, constructs that prove effective in 
the studies of Example 4 may be introduced into HIV-1 
producing human T lymphocytes, and methylation of the target 
sequence and its effect on virus production can be measured 
directly. The methylating construct may be delivered on a 

3 0 plasmid vector or as part of a recombinant adeno- associated 
virus (AAV) . These embodiments suggest a high level of 
usefulness of targeted methylation in the control of HIV-1 
proliferation, and the development of the methylating 
construct into a therapeutic agent. 



35 
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Direction of a DNA methyltransf erase to any CpG site within 
HIV-1 regulatory sequences, may result in the silencing of 
the provirus dependent upon the specific zinc finger 
protein/DNA methyltransf erase chimera. Zinc finger motifs 
5 have been shown to be remarkably versatile DNA binding 
structures (Choo and Klug, 1994) ; they are independent ly- 
folded protein modules that recognize the edges of 3 base 
pairs in the major groove. Over 1,300 zinc fingers have 
be en identified., and the binding specificity of a large 

10 number has been determined (Choo et al . , 1995). Several 
laboratories have succeeded in assembling zinc finger 
modules in novel combinations; these designed proteins have 
been subjected to refinement by phage -display selection > and 
variants that bind to novel predetermined sequences with 

15 high specificity and nanomolar affinity have been obtained 
by several laboratories (Choo et al . , 1994; Wu et al, 1995; 
Desjarlais and Berg, 1994; Rebar and Pabo, 1994) . These 
small proteins contain tandem arrays of three zinc finger 
modules and bind to 9 bp recognition sequences. Klug and 

2 0 colleagues were able to show that a zinc finger that was 

targeted to the BCR-ABL breakpoint sequence could bind to 
and suppress transcription of the BCR-ABL gene in cultured 
human leukemic cells (Choo et al . , 1994). A zinc finger 
protein designed to bind to a sequence adjacent to a CpG 
25 site in the HIV-1 5' LTR can be fused to an attenuated CpG- 
specific DNA methyltransf erase , thereby achieving efficient, 
selective methylation of that CpG site. A DNA 

methyltransferase with the desired binding properties has 
been cloned from a Spiroplasma species; this enzyme, known 

3 0 as M.SssI, has the same sequence specificity as the 

mammalian DNA methyltransferase (CpG) (as cloned by Bestor; 
Genbank Accession No. X14805) , but a higher turnover number 
and no requirement for hemimethylated substrates (Renbaum 
and Razin, 1992). The strategy is shown in Figure 3. 
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One embodiment of this invention is the development and 
therapeutic application of zinc finger protein/DNA 
methyltransf erase chimeras or agents whose general 
organization is similar to that of mammalian DNA 
5 methyltransf erase . These agents selectively methylate and 
inactivate the 5' LTR of HIV-1. Such agents have 
considerable promise as therapeutic agents for retroviral 
diseases, and for other applications where selective 
inactivation of a promoter sequence is desired. 

10 

The zinc finger protein/DNA methyltransf erase chimera of 
this invention is similar in design to mammalian DNA 
methyltransf erase, which has been purified (Bestor and 
Ingram, 1983; Bestor and Ingram, 1985a), characterized in 

15 terms of biochemical activities (Bestor and Ingram, 1985; 
Bestor, 1987) , the cDNA that encodes the enzyme cloned 
(Bestor, 1988; Genbank Accession No. X14805; Bestor et al . , 
1988) , and the protein demonstrated to be essential for 
mouse development in a gene disruption study (Li et al., 

20 1992; Li et al . , 1993). As shown below, mammalian DNA 
methyltransf erase has an N- terminal regulatory domain and a 
C- terminal catalytic domain that closely resembles bacterial 
restriction methyltransf erases . For a recent review of the 
structure and function of mammalian DNA methyltransf erase , 

25 see Bestor and Verdine (1994) . The agents which are 

embodiments of this invention differ from mammalian DNA 
methyltransf erase in that the regulatory domain targets the 
catalytic domain to predetermined sequences. The regulatory 

3 0 region of mammalian DNA methyltransf erase suppresses the 
methylation of previously unmethylated sites (Bestor, 1992) , 
and directs the protein to sites of new DNA synthesis in S 
(synthesis) phase nuclei (Leonhardt et al . , 1992). The 
targeted methylation which is an embodiment of this 

35 invention is grounded in the knowledge of the basic biology 
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of the system that establishes and maintains methylation 
patterns in the mammalian genome. 

The expression construct pCAL7 directs the expression of the 
5 DNA methyltransferase H.SssI (Renbaum and Razin, 1992). 
This enzyme in present in Spiroplasma species and has the 
same sequence specificity as mammalian DNA methyltransferase 
(CpG) , but a higher turnover number and no requirement for 
hemimethylated substrates (Renbaum and Razin, 1992) . The C- 

10 terminal domain of mammalian DNA methyltransferase also has 
CpG-specific DNA methyltransferase activity and may also be 
used in practicing this invention, although k cat is much 
lower than that of M.SssI (Bestor, 1992; Renbaum and Razin, 
1992) . The plasmid construct described in Example 1 may be 

15 confirmed by restriction analysis and is described in full 
in the following section and depicted in Figure S. This 
construct embodies (in a simple form) most of the principles 
of the targeted methylation approach described herein. 
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Table 1 shown below, offers data which validates the 
selection against indiscriminate methylation via McrBC 
restriction of methylated DNA (Table 1) . 

5 



E. coli strain Input plasmid CFU /ig DNA' 1 

(Mcr genotype) (methylation status) 



ER13 81 (McrBC*) 


pSE4 (— ) 


l.lxlO 9 


ER1793 (McrBC*) 


pSE4 (-) 


1.7 x 10* 


ER1381 (McrBC*) 


pCAL7 ( + ) 


6 x 10 5 


ER1793 (McrBC") 


pCAL7 ( + ) 


8 x 10 8 



Table 1. Selection against nonspecific methylation in 
McrBC+ E. coli. pSE4 is an unmethylated control plasmid; 

15 pCAL7 directs synthesis of M.Sssl (Renbaum et al., 1992) 
under the control of the p tac promoter. Plasmids were 
introduced into the indicated bacterial strains (Raleigh, 
1992; Sutherland et al . , 1992) by electroporation, and 
transformation efficiencies determined by colony counts on 

20 selective media. The values shown are conservative, as IPTG 
induction was not used in this case, and the plasmid was 
only partially methylated as a result of basal expression 
from the p tac promoter; measurements of Hpall resistance 
indicated <50% methylation. IPTG treatment increased the 

25 methylation level and increased restriction from a factor of 
~10 5 to 10 4 . 

Promoter inactivation via targeted cytosine methylation. 

30 A lambda repressor- integrase fusion protein has been shown 
to direct HIV-1 integration to sites in the vicinity of 
lambda operators (Bushman, 1994) . Through an analogous 
mechanism, a chimeric DNA binding protein/DNA 
methyltransf erase will cause preferential methylation of 
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sites in the vicinity of the recognition sequence of the DNA 
binding protein through a very large increase in the local 
concentration of DNA methyltransf erase . As shown in Figure 
3, this may be used to selectively methylate and silence 
5 predetermined promoter sequences. 

Crystallography data have shown that DNA cytosine 
methyltransferases make base-specific contacts in the major 
groove, and numerous sequence-independent contacts with the 
10 DNA backbone (Bestor and Verdine, 1994) . A diagram of these 
contacts is shown in Figure 5. 

Mutations of residues involved in the backbone contacts 
lowers the affinity of the enzyme for DNA and reduces the 

15 rate of methyl transfer. However, tethering the weakened 
DNA methyltransf erase to a sequence- specif ic DNA binding 
protein greatly increases the concentration of the DNA 
methyltransf erase in the vicinity of the binding site, thus 
offsetting the effects of reduced affinity. This results in 

20 a chimeric protein that methylates sites only in the immedi- 
ate vicinity of the recognition sequence of the DNA binding 
protein. Toxicity due to methylation of collateral CpG 
sites is obviated. Rather than estimating the nature of 
mutations that would yield the desired result, proteins with 

25 the desired properties may be selected from large 
combinatorial libraries. The design and selection of such 
chimeric proteins, and their development into potential 
therapeutic agents, is an embodiment of the subject 
invention. The 5' LTR of HIV-1 may be one target of 

30 inactivation, and a rational, step- wise approach to the this 
is described below in the Experimental Details. 

Gene Inactivation by Targeted Methylation 

35 1. Additional embodiments of methylating agents. 
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As mentioned above, the zinc finger/DNA methyltransf erase 
chimeras discussed and described herein are analogous in 
structure to mammalian DNA methyltransf erase, the enzyme 
that normally establishes and maintains methylation patterns 
5 in the genome. This enzyme has a zinc binding site that has 
been shown to participate in the discrimination of 
unmethylated and hemimethylated CpG sites (Bestor, 1992) . 
Replacement of this zinc-binding region with a tridactyl 
finger region of 'known binding specificity confers a new de 

10 novo specificity on the enzyme. Such an enzyme may be less 
immunogenic, and the efficiency of methylation may be 
increased because the coupling of DNA replication and 
methylation would be restored (Leonhardt et al . , 1992) . New 
generations of increasingly effective methylating agents may 

15 be developed using this approach. 

2 . Delivery of the methylating agent 

Described herein is a new therapeutic agent. Many delivery 
agents are described and encompass embodiments of this 

20 invention which are technically sufficient to allow delivery 
to the target cell population. The AAV transducing vector 
used in the experiments of Example 5 has many advantages, 
although other transducing vectors with superior properties 
are also encompassed in this invention. Delivery of the 

25 methylating agent to a target cell population has no addi- 
tional difficulties over those associated with other molecu- 
lar biological therapeutics. 

As pointed out earlier, in favorable cases the methylating 
30 agent need be present only transiently, since the new 
methylation pattern is subject to clonal inheritance, and 
gene silencing mediated by this agent is essentially irre- 
versible. This advantage is not enjoyed by alternative 
molecular biological therapeutics. In cases of recurrent 
35 infection or relapse due to silencing of an inadequate 
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fraction of the proviral load, further treatments with the 
original agent or an agent that recognizes a different 
critical CpG site may be undertaken. The highly versatile 
nature of the methylating agents makes such an approach more 
5 favorable than the extant molecular biological therapeutic 
agents . 

3. Prophylaxis in humans and animals 

Targeted methylation may also be applied to prophylaxis and 

10 the prevention of recurrent infections. One promising route 
involves a zinc finger/DNA methyltransf erase chimera that 
resides in the cytoplasm by virtue of the lack of nuclear 
localization signals, and which methylates incoming viral 
DNA during its transit through the cytoplasm. Previous work 

15 has demonstrated that retroviral DNA is accessible to 
nucleases (and therefore to methyltransf erases of similar 
size) while in the cytoplasm (Bowerman et al . , 1989), and 
methylation prior to integration would yield a provirus that 
is transcriptionally silenced from the time of integration. 

20 Long-term, stable expression of such a construct could 
protect recurring infections with HIV-1 or other susceptible 
viruses. Successful development of constructs that can - 
selectively silence specific viral promoters justifies the 
development of prophylactic derivatives for medical and 

25 veterinary applications and are also included as an 
embodiment of the instant invention. 

4 . Applications of targeted methylation to other gene 
inactivation studies 

30 The methylating agents described here are very versatile, 
and can be targeted to other promoters by installation of 
zinc finger moieties of the appropriate sequence 
specificity. The choice of HIV-1 as a target in this 
proposal was motivated by the severe threat to public health 

35 posed by this virus rather than by any special 
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vulnerability . Slight modification of the design and 
selection procedures can produce agents that methylate any- 
predetermined sequence, and such agents may be useful in any 
case in medicine or experimental biology where it is 
5 desirable . to silence a given promoter. 

This invention is illustrated in the Experimental Detail' 
section which follows. These sections are set forth to aid 
in an understanding of the invention but are not intended 
10 to, and should not be construed to, limit in any way the 
invention as set forth in the claims which follow 
thereafter . 



WO 97/11972 



PCT/US96/15576 



-39- 

Experimental Details 

In the construction of a nucleic acid vector (see Figure 6) , 
LexA was chosen for its proven ability to direct fused 
5 proteins to LexA binding sites (Brent and Ptashne, 1986), 
and M.SssI was chosen because of its high specific activity 
and the fact that it is the only bacterial methyltransf erase 
that has the same specificity (5'-CpG-3'; Renbaum and 
Razin, 1992) as the mammalian enzyme (Bestor and Ingram, 
10 1983) . 

Example 1. Targeted de novo methylation. 

The product of the construct represented in Figure 6 yields 
an increased local concentration of DNA methyltransf erase in 

15 the vicinity of LexA binding sites. As described above, 
this alone does not ensure targeted methylation, as the 
M.SssI moiety retains intrinsic activity towards all CpG 
sites and substantial methylation of collateral sites is to 
be expected; such indiscriminate methylation is lethal to 

20 mammalian cells. It is therefore necessary to make the DNA 
methyltransf erase moiety dependent on LexA-mediated DNA 
binding; this may be accomplished by selection of mutant • 4 « 
versions of M.SssI that have reduced intrinsic DNA binding 
activity. A novel cyclic in vivo/in vitro selection 

25 protocol is used to select mutant proteins of the desired 
character. Resistance to cleavage by a methylation - 
sensitive restriction endonuclease provides in vitro 
selection for methylation of the target site; growth of 
cleavage-resistant pools of plasmid in McrBCf strains of E. 

30 coli selects against chimeras that methylate non-target 
sites. (Mcr was named from modified cytosine restriction; 
the system causes the degradation of plasmid DNA that is 
methylated at many positions but has no effect on DNA that 
is methylated at single sites; Raleigh, 1992) . The selection 

35 scheme is depicted in Figure 7. 
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As mentioned above, the initial construct methylates both 
specific (that is, the Smal target site) and non-specific 
CpG sites elsewhere on the plasrnid. It is therefore 
necessary to transfer DNA binding authority from the 
5 catalytic moiety to the DNA binding moiety; this is done by 
selecting for mutations that prevent the methylation of non- 
specific sites while allowing methylation of the specific 
site . 

10 It cannot be predicted as to which mutations might give the 
desired reduction in affinity for DNA, so random mutations 
are introduced and selection is applied to obtain mutants of 
the desired character. The M.SssI moiety is mutagenized by 
Mn + * PCR (PCR amplification in the presence of Mft causes 

15 misincorporation of nucleotides and therefore transition 
and transversion mutations) from primers just external to 
the BamHI and AscI sites (the primers are depicted as arrows 
in Figure 8) . The PCR product is cleaved with BamHI and 
AscI, the fragment purified by gel electrophoresis and 

20 ligated between the AscI and BamHI sites of pLS (Figure 6) . 
The ligation products are electroporated into JM105 
(McrBC) . Calculations indicate that optimal ligation and 
electroporation conditions allow selection to be applied to 
>10 9 separate clones per screen (J. Hill and T. Bestor, data 

25 not shown) . After 20 min growth in rich liquid medium at 
37°C and 150 min in rich medium supplemented with 50 fig ml" 1 
ampicillin, the cells are washed 3 times and plasrnid DNA 
extracted by the alkaline lysis procedure. The DNA is 
treated with 5 units /zg" 1 Smal for 2 hr at 25°C, then 

30 transformed into McrBC+ E. coli strain ER1381. As 
previously described, the McrBC system degrades DNA that is 
methylated at many sites but does not affect DNA methylated 
at single sites (Raleigh, 1992) . It may be necessary to 
perform more than one iteration of the mutagenesis/selection 

35 procedure, but the process is not demanding and each itera- 
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tion of the complete cyclic in vivo/in vitro selection 
method can be completed in a few days. 

An additional selection step may be added to the cyclic 
selection procedure if the previous two steps give insuffi- 
cient enrichment for the clones of interest. Plasmid pools 
are extracted from McrBCZ bacteria and cleaved with Smal 
together with one of the 6 methylation-sensitive enzymes 
that have unique sites in pLS . if both sites are 
methylated, no cleavage occurs. If both sites are 
unmethylated, both will be cleaved. Only if one site is 
methylated and the other unmethylated will unit -length 
molecules with compatible ends be produced. These unit- 
length linear molecules are isolated by agarose gel 
electrophoresis, circularized by treatment with DNA ligase, 
and reintroduced into E. coli by electroporation . Note that 
this selection step, as in the case of the other two, can be 
completed in less than two days. 

The procedure described above is designed to produce a 
chimeric protein that methylates a predetermined target site 
adjacent to the binding site of a sequence-specific DNA 
binding protein. Candidate clones may be confirmed by 
simple assays. To confirm that Smal resistance is due to 
methylation of the target site and not to loss of the site 
by mutation, candidate plasraids are tested for resistance to 
Smal and sensitivity to the methylation- insensitive 
isoschizomer Xmal (both enzymes recognize the sequence 
CCCGGG) . Constructs that pass this test are examined for 
non-specific methylation by testing for sensitivity to the 
methylation-sensitive endonucleases BstUl (CGCG) , Hpall 
(CCGG), Mae 1 1 (ACGT) , Hhal (GCGC) . Resistance and sensitiv- 
ity is assessed from banding patterns after agarose gel 
electrophoresis. It can be concluded that targeted 
methylation was achieved upon recovery of plasmids that are 
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resistant to Srnal and sensitive to Xmal, BstUl, Hhal, Hpall, 
and Mae I I . 



If clones of the desired binding and enzymatic specificity 
5 do not emerge from the screen, a likely reason may be 
stearic incompatibility of the DNA binding protein and DNA 
methyltransf erase in the orientation shown in Figure 6 . New 
constructs may be made in which the order of the moieties 
are reversed, and the selection screen described above may 
10 be done on these new constructs. 



The number of iterations of the mutagenesis/selection 
protocol that is required to produce constructs with the 
required characteristics may be variable. However, the 
15 procedure is rapid, and multiple iterations are not 
demanding . 



Constructs that have the desired binding and enzymatic 
specificity are sequenced in order to identify mutations 

20 that make DNA methyltransf erase activity dependent on the 
binding energy of the DNA binding fusion partner. The 
' nature and position of the mutations are referred to- -the 3D 
structure of cytosine methyltransf erases M.Hhal 
(Klimasauskas et al . , 1994) and M.Haelll (Reinisch et al . , 

25 1995) ; both of these enzymes are closely related to each 
other and to M.SssI, and conservation of tertiary structure 
is very likely. The desired mutations may map to one of the 
many residues that make nonspecific contacts with the DNA 
backbone (reviewed by Bestor and Verdine, 1994) . These 

3 0 mutations are introduced into the starting constructs of 
Example 3 so as to reduce the amount of selection to obtain 
constructs of the desired character. 

Example 2. Identification of those CpG sites in the HIV-1 5' 
35 LTR whose methylation produces maximal repression of 
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transcription. As described previously, viral transcription 
is strongly repressed by methylation of the HIV-l 5' LTR 
(reviewed by Bednarik, 1992) . It is very likely that this 
effect is exerted through a subset of the 11 CpG sites 
5 present in the 5' LTR of HIV-l (see Figure 1) . The critical 
CpG sites are identified by measurements of transcription 
rates of test constructs that bear single methylated CpG 
sites. The CpG site that gives the highest degree of 
inhibition when methylated serves as the target for Examples 
10 3 and 4. 

The approach is related to methods established by 
Busslinger et al . (1983). The Pstl-BamHI insert of pUC- 
BENN-CAT (which contains the HIV-l 5' LTR driving 

15 transcription of the bacterial chloramphenicol 
acetyltransf erase (CAT) gene) (Gendelman et al . , 1986) is 
excised and cloned into M13 mp!8 . Single-stranded phage DNA 
is prepared, and synthetic oligonucleotide primers that bear 
m 5 C at single specific CpG sites are hybridized to an M13 

2 0 clone of the sequence of interest. The primer is extended 
with a DNA polymerase; sequenase is the polymerase of choice 
for. this invention because of its high processivity and low ... - 
yield of partially extended products. The nick at the 5' 
end of the primer is sealed by incubation with E. coli DNA 

25 ligase and the circular double stranded molecules are 
purified away from single stranded template and 
oligonucleotide primer by agarose gel electrophoresis. The 
circular DNA is purified from gel slices, cleared of 
ethidium bromide by extraction with phenol, collected by 

30 ethanol precipitation, and mixed with equimolar amounts of 
an act in promoter- lucif erase expression construct prior to 
transfection into HL2/3 cells (Ciminale et al . , 1990). This 
cell line expresses high levels of the HIV-l proteins Gag, 
Env, Tat, Rev, and Nef proteins and affords a good 

35 simulation of HIV-l transcription in infected T cells; it is 
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especially important to identify CpG sites that suppress 
transcription when methylated and in the presence of the 
virus-encoded stimulatory factor Tat. The hemimethylated 
sites introduced into the constructs are converted to 
5 symmetrically methylated sites immediately upon transfection 
(Busslinger et al., 1983) . The ratio of CAT activity to lu- 
cif erase activity (both are measured by simple and well-es- 
tablished assays) are taken as the measure of inhibition by 
CpG methylation. Control constructs are either completely 

10 unmethylated or completely methylated (by extension of the 
primer in the presence of dm 5 CTP) . The location of the 11 
CpG sites and the method of synthesis of the methylated 
expression constructs is shown in Figure 5. The plasmid 
pUC-BENN-CAT and the cell line HL2/3 are distributed by the 

15 NIH AIDS Research and Reference Reagent Program, and have 
been obtained from that source. This experiment is 
straightforward, and the results are invaluable for the 
experiments described in Examples 3, 4, and 5. The assays 
are done both in transiently transfected cells, and in cell 

20 clones that have stably integrated the construct; these 
clones are established by G418 selection after unlinked co- 
transformation" with pSV2Neo and the pUC-BENN-CAT constructs. 

Example 3. Design, selection, and affinity maturation of 
25 zinc finger-DNA methyl transferase chimeras that methylate 
critical CpG sites in the HIV 5' LTR. There are three 
important factors to be taken into account when selecting a 
site for targeted methylation in the HIV-1 5' LTR. First, 
the site must have the ability to repress transcription when 
30 methylated; this is established in the. experiments of 
Example 2. Second, the binding site for the zinc finger 
moiety must be adjacent to (rather than within) the 
recognition sequence of a regulatory factor. The factors 
that bind to the 5' LTR are all host -encoded and are 
3 5 involved in the transcription of many cellular genes; 
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methylation of such sites is expected to cause unacceptable 
toxicity through inactivation of vital cellular genes. 
Third, the recognition sequence is preferably from 6 to 25 
base pairs in length and more preferably at least 9 base 
5 pairs in length, because such sequences occur by chance only 
every 4 9 (or 262,144) base pairs. This rarity ensures that 
target sites do not occur by chance in significant numbers 
of cellular promoters. 

10 Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 
display, and fused to the CpG-specific DNA methyltransf erase 
M.SssI as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 

15 in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 
finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 

2 0 few days are required for each iteration, and large numbers 
of clones (>10 9 per cycle) can be subjected to selection at 
once . . 

The method for the selection of tridactyl zinc finger 
25 proteins that bind to predetermined sequences is shown in 
Figure 9 . 

The method depicted in Figure 9 is used to select a 
tridactyl finger protein that binds to a 9 bp sequence adja- 

30 cent to a critical CpG site (identified in Example 2) . The 
spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). This 

35 spacing is equivalent to roughly 25 base pairs. 
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The selected finger protein is fused to M.SssI 
methyltransf erase via a random linker to create a second 
combinatorial expression library. From this point forward 
the protocol is very similar to that described in Example 1. 
5 The construct differs from that shown in Figure 6 only in 
that LexA is replaced with the coding region for the zinc 
finger protein, and the LexA binding site is replaced with 
the HIV-1 LTR sequence used for selection. The c-MYC 
epitope tag is included to allow visualization of the 

10 protein on immunoblots and by immunofluorescence. The 
sequence and spacing of the central CpG in the Smal site and 
the recognition sequence for the zinc finger protein is the 
same as that in the HIV-1 LTR target sequence; a minimal 
change that converts the sequences flanking the CpG site to 

15 a Smal site is introduced. This change should not pose any 
problems, as the intrinsic sequence specificity of M.SssI is 
limited to the CpG dinucleotide (Renbaum and Razin, 1993). 
As was previously done with the LexA construct, cyclic in 
vivo/in vitro selection is used to obtain mutant chimeras in 

2 0 which the DNA methyltransf erase moiety depends on the finger 

protein part of the energy of DNA binding. This obviates 
methylation of collateral CpG sites, which is known to be- 
toxic to mammalian cells. As with the LexA construct of 
Example 1, it may be confirmed that Smal resistance is due 
25 to methylation of the target site, and not to loss of the 
site by mutation, by testing for resistance to Smal and 
sensitivity to the methylation- insensitive isoschizomer 
Xmal. Constructs that pass this test are examined for non- 
specific methylation by testing for sensitivity to the 

3 0 methylation- sensitive endonuclease BstUl (CGCG) , Hpall 

(CCGG) , Maell (ACGT) , Hhal (GCGC) . Almost 100 CpG sites 
within the plasmid construct may be tested through use of 
these enzymes. Resistance and sensitivity is assessed from 
banding patterns after endonuclease treatment and agarose 
35 gel electrophoresis. As in Example 1, it may be concluded 
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that targeted methylation has been achieved upon recovery of 
plasmids that are resistant to cleavage by Smal and 
sensitive to Xmal, BstUl , Hhal, Hpall, and Maell. 

5 Alternatively, as with the LexA construct, constructs in 
which the order of the zinc finger and M.SssI moieties has 
been reversed are constructed and subjected to the selection 
protocol described above. As in the case of the pLS 
constructs described in Example 1, the data indicate that it 
10 is possible to screen between 10 9 and 10 10 independent clones 
at once, and each iteration of the selection protocol 
requires only a few days. 

Example 4. Inhibition of HIV-1 5' LTR-dependent 
15 transcription in cultured human cells that express zinc 
finger/DNA methyl transferase fusion proteins of novel and 
predetermined specificity. Example 3 yields constructs that 
methylate critical sites in the HIV-1 5' LTR. These 
constructs are introduced into cultured human cells that 
20 express a reporter gene from the HIV-1 5' LTR as a means of 
testing the inhibitory effects of targeted de novo methy- 
lation. 

The plasmid pUC-BENN-CAT (Gendelman et al . , 1986) is 
25 introduced into HL2/3 cells (Ciminale, 1990) together with 
unlinked pSV2Neo, and stable transf ectants selected with 
G418. The HL2/3 cell line expresses most of the HIV-1 
accessory proteins (including Tat) from a replication- 
defective HIV-1 provirus (Ciminale et al., 1990). The 
JO presence of HIV-1 viral accessory proteins renders this 
assay a good simulation of HIV-1 transcription in infected 
cells. Transf ectant clones that express the CAT gene from 
the HIV-1 LTR are identified by RNA blot hybridization. 
Such cell clones are referred to as HL2/3 -LTR-CAT . 



35 



WO 97/11972 



PCT/US96/15576 



-48- 

HL2/3 -LTR-CAT cells are transfected by the calcium phosphate 
technique with a mammalian expression vector that directs 
production of the zinc f inger-M. SssI fusion protein (Example 
3) from the cytomegalovirus (CMV) immediate early enhancer- 
5 promoter. The starting vector is pEVRFO (Matthias et al . , 
1990) , which contains the CMV promoter, a beta globin intron 
in the 3' untranslated region for enhanced transcript 
stability, and a polylinker region for convenient cloning. 
We have used this vector with good success in many 

10 constructs (Czank et al., 1991; Leonhardt et al., 1992). 
This expression construct is termed pCMV-ZMet. A pSV2Hyg 
resistance marker is co- transfected with pCMV-ZMet, and the 
cells subjected to double selection with G418 and 
Hygromycin. Production of zinc f inger/M. SssI fusion protein 

15 are assayed by immunoblot analysis; as with the LexA-M.SssI 
fusion protein of Figure 3, an N-terminal c-MYC epitope tag 
is present, and the protein is visualized with the 9E10 
antibody. 

2 0 Immunofluorescence microscopy is used to determine whether 

the methylating agent can gain access to nuclei. If it is 
found to be confined to the cytoplasm, the nuclear 
localization signal of SV40 large T antigen may be added to 
the N- terminus, as we have done for a class of fusion pro- 
25 teins to induce their translocation into nuclei (Leonhardt 
et al. , 1992) . 

Cells that express zinc f inger/M . SssI fusion protein are 
examined for loss of CAT expression by CAT enzyme assay and 

3 0 by RNA blot hybridization. In the event that the most 

critical CpG site happens to lie within one of the four 
sites for methylation-sensitive restriction endonuclease 
(four of the eleven sites can be assayed in this way; see 
Figure 1), methylation may be assayed by cleavage of DNA 
3 5 with that enzyme followed by DNA blot hybridization. This 
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is a simple and long-established technique for determination 
of the methylation status of individual CpG sites. If the 
critical site chosen for targeted methylation does not lie 
within such a restriction site, the genomic sequencing 
5 method of Grigg and Clark. (1994) may be used to identify 
methylated cytosines. The method of Collins and Myers 
(1987) can also be used to detect methylation differences. 
This method exploits the increased melting temperature of 
methylated DNA, which allows methylated and unmethylated 
10 fragments to be resolved by denaturing gradient gel 
electrophoresis (Collins and Myers, 1987) . Both methods are 
well-established and both will allow unambiguous 
localization of methylated sites in HIV-1 LTR sequences. 

15 The embodiments described herein of this invention indicate 
that expression of a finger protein/M. SssI chimera targeted 
to a critical CpG site in the HIV-1 LTR methylates and 
inactivates expression of a reporter gene that is driven by 
the LTR. 

20 

Example 5. Inhibition of HIV-1 replication in T lymphocytes 
productively infected with HIV-1. Jurkat E6-1 cells are 
electroporated with infectious molecular clones of HIV-1 
strain NL4-3, and a productive infection established. The 

25 HIV-producing cells are transfected with the finger 
protein/M. SssI construct described in the previous section, 
together with pSV2Neo as a selectable marker. A vector con- 
trol and a finger protein-only control may be used. This 
latter control is necessary to distinguish inhibition via 

30 methylation from that resulting from direct binding of the 
finger protein to its target site, as has happened in one 
other case (Choo et al . , 1995). The finger protein/M . SssI 
chimera is designed to cause de novo methylation and 
transcriptional repression of HIV-1 proviral DNA. 

3 5 Methylation is assayed as described in the previous section, 
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and viral transcription is assayed by RNA blot hy- 
bridization . 

The plasmid-mediated system test described above is 
5 convenient and reliable but does not simulate conditions 
that may be encountered in a therapeutic application. A 
vector that addresses the needs of therapeutic applications 
is therefore used and described below. One of the most 
promising viral vectors is based on adeno-associated -virus 
10 (AAV) , and the methylating agent may be delivered through 
use of recombinant AAV. 



AAV has several advantages as a transducing vector: I. AAV 
is not associated with any disease, ii. The viral DNA is 

15 thought to integrate at a unique site on chromosome 19q, 
with little risk of insertional mutagenesis, iii. it has 
been reported that AAV can infect non-replicating cells 
(reviewed in Berns and Linden, 1995) , iv. AAV can infect 
lymphoid cells (Mendelson et al., 1992), and transcription 

20 units borne by recombinant AAV strains are expressed in T 
lymphocytes (Chatterjee et al . , 1992), and v. Virulent 
strains of AAV cannot arise by recombination within infected - 
cells, as can occur with retroviral vectors. 

25 Introduction of methylating agents into transducing AAV 
vectors is straightforward. The finger protein/DNA 
methyltransf erase coding region under control of the CMV 
immediate early enhancer-promoter is amplified by PCR with 
primers that introduce Xbal sites at both ends of the 

30 construct. This expression cassette is ligated between the 
Xbal sites of pAVl , an infectious molecular clone of AAV 
(Banerjee et al., 1992). The resulting clones are selected 
and propagated in E. coli, and confirmed by direct 
sequencing; clones of the desired specificity are termed 

35 pAV-ZMet. Control clones that express only the zinc finger 
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protein are constructed and termed pAV-Zif; clones that 
express only the mutant M.SssI are constructed and termed 
pAV-Met. Infectious recombinant virus stocks are obtained 
by co-transfection of adenovirus -infected HeLa cells with 

5 pAV-ZMet, pAV-Zif, or pAV-Met, together with pTAAV, a 
plasmid that encodes proteins required for replication and 
packaging (Banerjee et al., 1992). The titer of virus 
stocks obtained in this manner are commonly 10 7 mi 1 
(Banerjee et aL, 1992).;. such viruses are also capable of 

0 infecting CD4* T lymphocytes such as Jurkat cell with high 
efficiency (Mendelson et al . , 1992). The recombinant 
viruses are termed AAV-ZMet , AAV-Zif, and AAV-Met, in 
keeping with the terminology of their plasmid precursors. 

5 Jurkat cells productively infected with HIV-1 strain NL4-3 
(as described above) are infected with AAV-ZMet , AAV-Zif, or 
AAV-Met at an MOI that infects >90% of the cell population, 
as established by immunofluorescence microscopy with 
antibodies against the c-MYC epitope encoded by the fusion 
proteins. Jurkat cells are highly susceptible to infection 
by AAV (Mendelson et al . , 1992) RNA blot hybridization is 
used to evaluate- the effect of expression on transcript 
accumulation, as has been done in antisense inhibition 
experiments that utilized AAV expression vectors {Banerjee 
et al., 1992). The methylation status of integrated HIV-1 
proviral DNA may be evaluated as described in Example 3 . 
Comparison of transcript levels, cell survival, and proviral 
methylation among cultures infected with vector control, 
AAV-ZMet, AAV-Zif, and AAV-Met can give a clear indication 
of the efficacy of targeted methylation in suppressing HIV-1 
expression under conditions that approximate those to be 
encountered in actual therapeutic applications. 

Example 6- Design, selection, and affinity maturation of 
zinc finger-DNA methyltransf erase chimeras that methylate 
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critical CpG sites in a promoter of the hepatitis B virus. 

Chronic hepatitis due to infection with Hepatitis B virus 
(HBV) is a major cause of cirrhosis of the liver and 
5 hepatocellular carcinoma (reviewed by Ganem and Varmus, 
1987). Approximately 1.5% of the population is infected 
with HBV (reviewed by Saracco and Rezzetto, 1995) and the 
worldwide morbidity and mortality associated with chronic 
HBV infection is far greater than that due to HIV-1 

10 infection. As shown in Figure 10, the HBV genome is rich in 
CpG sites (there are 103 in a genome of only 3.2 kb; Ono et 
al., 1983) and transcription from HBV promoters is sensitive 
to CpG methylation (Miller and Robinson, 1983; Pourcel et 
al., 1990; The contents of these references are incorporated 

15 in their entirety into the subject application) . HBV 
proviral DNA is therefore susceptible to inactivation by 
targeted methylation according to the methods elaborated for 
HIV-1 inactivation herein. 

20 Identification of critical CpG sites, design of chimeric 
zinc finger protein/DNA methyltransf erases , and cyclic in 
vivo/in vitro selection of chimeras that methylate only the 
target CpG site is as described for the HIV-1 targets in the 
above examples. 

25 

One embodiment of the subject invention is to target the 
known promoters of the hepatitis B virus and inactivate the 
viral machinery via methylation. The site for targeted 
methylation in the hepatitis B promoter is selected as 

30 follows. First, the site will have the ability to repress 
transcription when methylated; this is established in the 
experiments of Example 2 above. Second, the binding site 
for the zinc finger moiety will be adjacent to (rather than 
within) the recognition sequence of a regulatory factor. 

3 5 The factors that bind to the promoter of the viral X protein 
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or the pre-S2/5 promoter are host-encoded and are involved 
in the transcription of many cellular genes; methylation of 
such sites is expected to cause unacceptable toxicity 
through inactivation of vital cellular genes. Third, the 
5 recognition sequence is optimally from 6 to 25 base pairs in 
length and more preferably at least 9 base pairs in length. 

Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 

10 display, and fused to the CpG-specific DNA methyltransf erase 
M.SssI as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 
in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 

15 finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 
few days are required for each iteration, and large numbers 
of clones (>10 9 per cycle) can be subjected to selection at 

20 once. 

The -method for the selection of tridactyl zinc finger" 
proteins that bind to predetermined sequences is shown in 
Figure 9 . 

25 

The method depicted in Figure 9 is used to select a 
tridactyl finger protein that binds to a 9 bp sequence adja- 
cent to a critical CpG site in either the promoter of viral 
protein X or the pre-S2/5 promoter of hepatitis B. The 
30 spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). This 
spacing is equivalent to roughly 25 base pairs. 
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Figure 11 A and Figure 11B illustrate targeted methylation 
of a preselected CpG site in a pLS vector which encodes a 
lexA-M.Sssl fusion protein. This data proves that a CpG- 
specific DNA methyltransf erase can be directed to a target 
5 site that is adjacent to the binding site of a sequence- 
specific DNA binding protein. This targeted methylation was 
achieved in living bacterial cells. Mutagenesis and cyclic 
in vivo/in vitro selection is being applied so as to obtain 
a chimeric protein that will methylate the target site but" 
10 have no effect on collateral CpG sites. The data shown 
above and specifically in Figures 11A-11B, demonstrate that 
the principle of targeted methylation has been reduced to 
practice in the laboratory. 

15 Example 7: Targeted Methylation in the Repression of HIV-1 
Replication 

A. Construction of chimeric DNA methyltransf erase of novel 
and predetermined specificity. 

20 

The Lac repressor was fused to the N- terminus of M.SssI by 
recombinant DNA ' technology and the chimeric protein was 
expressed in E . coli. The construct used in these 
experiments is shown in detail in Figure 12. The construct 

25 also contained a synthetic sequence that included the Lac 
repressor binding site and a cluster of CpG dinucleotides . 
This construct was used to establish three important points: 
First, that chimeric DNA methyltransf erase could be produced 
in bacterial cells in a stable and soluble form; second, 

30 that the chimeric DNA methyltransf erase would selectively 
methylate CpG sites in the vicinity of the recognition 
sequence of the lac repressor, and third, the optimal 
spacing between recognition sequence and target CpG site is 
20-30 nucleotides. 

35 
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The construct shown in Figure 12 was electroporated into E. 
coli and synthesis of fusion protein induced by addition of 
isopropylthiogalactoside to culture medium. After 3 hours 
of incubation cells were lysed, and protein extracts were 
5 subjected to SDS gel electrophoresis and immunoblot analysis 
with an anti-Lac repressor antibody. As shown in Figure 13, 
the chimeric DNA methyltransf erase was soluble and resistant 
to proteolysis as shown by its appearance as a single band 
of the appropriate size. There was little sign of a 
10 reduction in growth rate upon induction of synthesis of the 
chimeric DNA methyltransf erase . This confirmed that the 
chimeric DNA methyltransf erase was not toxic to the host 
cells . 

15 Plasmid DNA was purified from the same cells and high 
resolution mapping of methylated sites was carried out by 
the bisulfate genomic sequencing method. Bisulfite converts 
cytosine to uracil by oxidative deamination. 5- 
methylcytosine is resistant to bisulfite attack. Upon PCR 

20 amplification all cytosine will be copied as thymidines but 
5-methylcytosine will be preserved as cytosines. As shown 
in Figure 14, plasmid DNA exposed in vivo to Lac repressor- 
M.SssI fusion proteins underwent methylation at CpG sites 
near the Lac repressor binding site; vector DNA was not 

25 methylated, and plasmid exposed to M.SssI became methylated 
at all CpG sites. These data confirm that fusing a DNA 
methyltransferase to a sequence-specific DNA binding protein 
can limit methylation to CpG sites in the vicinity of the 
recognition sequence of the DNA binding protein as 

30 originally proposed. 

Figure 14 also shows that targeted methylation occurred 20- 
30 nucleotides away from the 3' border of the Lac repressor 
binding site. This is taken to reflect steric interference 
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between protein moieties. The result was expected and is 
not a limitation of the method. 

The Lac repressor-M.SssI fusion proteins confirmed the 
5 principle of targeted methylation. Figure 15 depicts a new 
class of chimeric DNA methyltransf erases that have been 
constructed and confirmed by DNA sequencing. The sequence- 
specific DNA binding moiety is a three-finger zinc finger 
protein (Zif268) of known binding specificity. These agents 
10 are being subjected to the same characterization as the Lac 
repressor-M. SssI fusions. They have also been cloned into 
mammalian expression vectors and experiments in cultured 
primate cells are being performed. 

15 In summary, chimeric DNA methyltransf erase -DNA binding 
proteins have been shown to selectively methylated CpG sites 
in the vicinity of the recognition sequence of the DNA 
binding protein moiety. 

20 B. Inactivation of HIV-1 by CpG methylation. 

As shown in Figure 16, the LTR of HIV-1 has been cloned 
upstream * of two reporter genes, chloramphenicol 
acetyltransferase (CAT) and firefly luciferase (Luc) . Both 
constructs drive high-level expression of the CAT of Luc 
25 . reporter genes when transfected into tat-expressing HL2/3 or 
HLTAT human cervical carcinoma cells. 

Methylation is known to suppress transcription from most 
promoters but its effect on HIV-1 LTR-driven transcription 

30 has not been documented in a system that resembles the 
infected cells. Figure 17 shows that CpG methylation of 9 
CpG sites in the HIV-1 LTR reduces transcription to near 
background levels; the effect is at least 20 fold. The 
effect appears to be irreversible and does not require 

35 continued application or expression of any foreign agents. 
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CpG methylation can therefore be regarded as the most 
powerful way to suppress the activity of HIV-1. Work is 
underway that will identify the CpG sites that mediate the 
biological effects of methylation. 

5 

Work as described herein, has shown that M.SssI does not 
methylate single stranded DNA . This has led to the 
development of a new method for the region-specific 
methylation of a DNA fragment. A single-stranded 

10 oligonucleotide or longer fragment synthesized by asymmetric 
PCR is annealed to a single-stranded circular DNA and 
treated with M.SssI. CpG sites in the double stranded 
region of the plasmid are methylated while all single 
stranded regions are unmodified. This new method reduces 

15 the time required to identify critical CpG sites. The 
regional methylation method is being used to evaluate the 
effects of CpG methylation on the HIV-1 LTR versus the gag 
leader sequence; this latter sequence is rich and has 
strongly conserved CpG sites {Figure 16) and methylation of 

20 the sites may suppress transcription of HIV-1 as well or 
better than methylation of LTR sequences. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(I) APPLICANT: Bestor, Timothy H. 



(ii) TITLE OF INVENTION: CHIMERIC DNA-BINDING/DNA METHYLTRANSFERASE 

NUCLEIC ACID AND POLYPEPTIDE AND USES 
THEREOF 



(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 
(D> STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 31-JAN-1996 

(C) CLASSIFICATION: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: White, John P. 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE /DOCKET NUMBER: 48075-A 



(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 212-278-0400 
{B) TELEFAX: 212-391-0526 



(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



<xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG ATCTACCACA 60 

CACAAGGCTA CTTCCCTGAT TAGCAGAACT ACACACCAGG GCCAGGAGTC AGATATCCAC 120 

TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA GAAGAGGCCA 18 0 

ACAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGGATG GATGACCCGG 240 

ATATATAAGT GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC GTGGCCCGAG 300 

AGCTGCATCC GGAGTACTTC AAGAACTGCT GAT AT C GAG C TTGCTACAAG GGACTTTCCG 360 

CTGGGGACTT TCCAGGGAGG CGTGGCCTGG GCGGGACTGG GGAGTGGCGA GCCCTCAGAC 420 

CTGCATATAA GCAGCTGCTT TTTGC CTGT A CTGGGTCTCT CTGGTTAGAC CAGATCTGAG 480 

CCTGGGAGCT CTCTGGCTAG CTAGGGAACC C ACT GCTT AA GCCTCAATAA AGCTTGCCTT 54 0 

GAGTGCTTCA AGTAGTGTGT GCCCGTCTGT TGTGTGACTC TGGTAACTAG AGATCCCTCA 600 

GACCCTTTTA GTCAGTGTGG AAAATCTCTA GC 632 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION :. SEQ . ID NO : 2 : 
CCCCCCTGTG AGCCTGCGGG TTTTTTCCCG CAGGCTCACA 



40 
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What is claimed is: 

1 . A chimeric protein which comprises a mutated DNA 
methyltransf erase portion and a DNA binding protein 
portion that binds sufficiently close to a promoter 
sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate 
the site and inhibit activity of the promoter and thus 
inhibit expression of the target gene. 

2. The protein of claim 1, where in the promoter sequence 
of the target gene is a 5' long terminal repeat sequence 
of a human immunodeficiency virus -1 proviral DNA. 

3. The protein of claim 1, wherein the target gene 
comprises a retroviral gene, an adenoviral gene, a foamy 
viral gene, a parvoviral gene, a foreign gene expressed 
in a cell, an overexpressed gene, or a misexpressed 
gene . 

4. The protein of claim 1, wherein the chimeric protein 
comprises a zinc three-finger DNA binding polypeptide 
linked to a CpG-specific DNA methyltransf erase 
polypeptide . 

5. The protein of claim 1, wherein the chimeric protein 
comprises a mutated Lex A binding polypeptide linked to 
a cytosine methyltransf erase polypeptide. 

6. The method of claim 1, wherein the mutated DNA 
methyltransferase portion comprises at least a portion 
of a mutated M.SssI DNA methyltransferase protein or at 
least a portion of a mutated mammalian DNA 
methyltransferase protein. 
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7. An expression vector which encodes the chimeric protein 
of claim 1 . 

8. The vector of claim 7, wherein the expression vector is 
replicable . 

9. The vector of claim 7, wherein the vector is a pLS 
vector . 

10. The vector of claim 7, wherein the vector is a 
prokaryotic expression vector, a yeast expression 
vector, a baculovirus expression vector, a 
mammalian expression vector, or an episomal 
mammalian expression vector. 

11. A method for inhibiting expression of a target gene 
which comprises contacting a promoter of the target 
gene with the chimeric protein of claim 1 so as to 
specifically methylate the promoter thus inhibiting 
expression of the target gene. 

12. The method of claim 11, wherein the target gene is 
an endogenous target gene . 

13. The method of claim 11, wherein the target gene is 
a foreign target gene. 

14. The method of claim 13, wherein the foreign target 
gene is a retroviral gene or a viral gene. 

15. The method of claim 11, wherein the target gene is 
associated with a cancer, a central nervous system 
disorder, a blood disorder, a metabolic disorder, 
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a cardiovascular disorder, an autoimmune disorder, 
or an inflammatory disorder. 

16. The method of claim 15, wherein the cancer is acute 

lymphocytic leukemia, acute myelogenous leukemia, 
B-cell lymphoma, lung cancer, breast cancer, 
ovarian cancer, prostate cancer, lymphoma, 
Hodgkin's disease, malignant melanoma, 
neuroblastoma, renal cell carcinoma or squamous 
cell carcinoma. 



17. The method of claim 15, wherein the central nervous 
system disorder is Alzheimer's disease, Down's 
syndrome, Parkinson's disease, Huntington's 
disease, schizophrenia, or multiple sclerosis. 

18. The method of claim 15, wherein the infectious 
disease is cytomegalovirus, herpes simplex virus, 
human immunodeficiency virus, AIDS, papillomavirus, 
influenza, Candida albicans, mycobacteria, septic 
shock, or associated with a gram negative bacteria. 

19. The method of claim 15, wherein the blood disorder 
is anemia, hemoglobinopathies, sickle cell anemia, 
or hemophilia. 

20. The method of claim 15, wherein the cardiovascular 
disorder is familial hypercholesterolemia, 
atherosclerosis, or renin/angiotensin control 
disorder. 

21. The method of claim 15, wherein the metabolic 
disorder is ADA, deficient SCID, diabetes, cystic 
fibrosis, Gaucher' s disease, galactosemia, growth 
hormone deficiency, inherited emphysema, Lesch- 



PCT/US96/15576 



-72- 

Nyhan disease, liver failure, muscular dystrophy, 
phenylketonuria, or Tay-Sachs disease. 

The method of claim 15, wherein the autoimmune 
disorder is arthritis, psoriasis, HIV, or atopic 
dermatitis , 

The method of claim 15, wherein the inflammatory 
disorder is acute pancreatitis , irritable bowel 
syndrome, Chrone's disease or an allergic disorder. 

The method of claim 11, wherein the target gene is 
in a cell. 

The method of claim 24, wherein the cell is a 
eukaryotic cell, a bacterial cell, an animal cell, 
a plant cell, a prokaryotic cell, a virus packaging 
cell, a somatic cell, a germ cell, a neuronal cell, 
a myocyte, a T lymphocyte, a CD4 + cell, a tumor 
cell, a CD4+ cell, or a stem cell. 

The method of claim 11, wherein the contacting is 
by means of liposome mediated delivery, retroviral 
delivery, gene bombardment, electroporat ion or 
cationic precipitation. 

A method for inhibiting expression of a target gene 
in a multicellular organism which comprises 
contacting a promoter sequence of the target gene 
with the chimeric protein of claim 1, so as to 
specifically methylate the promoter sequence and 
thus inhibit expression of the target gene in the 
multicellular organism. 
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The method of claim 27, wherein the multicellular 
organism is a plant, an animal or a human. 

The method of claim 28, wherein the plant is an 
alfalfa plant, a broccoli plant, a rapeseed plant, 
a carrot plant, a chicory plant, a coffee plant, a 
cucurbita plant, a euromelon plant, a potato plant, 
a raspberry plant, a sunflower plant, a tomato 
plant, or a wheat plant. 

The method of claim 28, wherein the animal is a 
horse, a primate, a porcine animal, a bovine 
animal, a swine, a fowl, or a fish. 

The method of claim 27, wherein the chimeric 
protein or a nucleic acid encoding the chimeric 
protein is delivered to the multicellular organism 
via intralesional , intraperitoneal, intramuscular 
or intravenous injection; liposome-mediated 
delivery; viral infection; gene bombardment; 
topical, nasal, oral, anal, ocular or otic 
delivery. 

The method of claim 31, wherein the viral infection 
is via a non- integrating, replication-defective 
virus . 

The method of claim 32, wherein the virus comprises 
a replication-defective Human Immunodeficiency Type 
1 provirus, a retroviral vector, an adeno- 
associated virus, a LNL6 vector, a LXSN vector or 
a MMuLV retroviral vector. 

A method of treating a subject infected with a 
virus which comprises administering to the subject 
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a therapeutic composition comprising the chimeric 
protein of claim 1, or a nucleic acid molecule 
encoding the chimeric protein of claim 1, under 
suitable conditions so as to specifically methylate 
the viral promoter sequence and inhibit expression 
of the viral gene thus treating the subject 
infected with the virus. 

The method of claim 34, wherein the virus is chosen 
from the group consisting of a DNA virus, a 
retrovirus, a herpes virus, an immunodeficiency 
virus, an adeno-associated virus and an adenovirus. 

The method of claim 34, wherein the therapeutic 
composition comprises a nucleic acid molecule 
encoding a mutated Lex A DNA binding protein 
portion linked to a mutated DNA methyl transferase 
protein portion. 

The method of claim 34, wherein the therapeutic 
composition comprises a nucleic acid molecule 
encoding a tridactyl zinc finger DNA binding 
protein portion capable of specifically binding the 
Human Immunodeficiency Virus Type 1 5 'long terminal 
repeat nucleic acid sequence linked to a mutated 
DNA methyltransf erase protein portion. 

The method of claim 34, wherein the subject is a 
human . 

The method of claim 34, wherein the therapeutic 
composition comprises a replicable expression 
vector chosen from the group consisting of a pLS 
vector, a prokaryotic expression vector, a yeast 
expression vector, a baculovirus expression vector, 
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a mammalian expression vector, and an episomal 
mammalian expression vector. 

40. The method of claim 34, wherein the administration 
comprises int rales ional , intraperitoneal , 
intramuscular or intravenous injection; liposome- 
mediated delivery; viral infection; gene 
bombardment; topical, nasal, oral, anal, ocular or 
otic delivery. 

41. The method of claim 40, wherein the viral infection 
is via a non- integrating, replication-defective 
virus . 

42. A host cell comprising the expression vector of 
claim 7 . 

43. The host cell of claim 42, wherein the host cell is 
chosen from the group consisting of a eukaryotic 
cell, a somatic cell, a germ cell, a neuronal cell, 
a myocyte, a T lymphocyte, a prokaryotic cell, a 
virus packaging cell., a plant cell, a prokaryotic 
cell, a tumor cell, a stem cell and a CD4+ cell. 

44 . A pharmaceutical composition comprising a 
therapeutically effective amount of the expression 
vector of claim 7 and a pharmaceutical^ acceptable 
carrier. 

45. The pharmaceutical composition of claim 44, wherein 
the carrier comprises a diluent. 

46. The pharmaceutical composition of claim 44, wherein 
the pharmaceutical^ acceptable carrier is an 
aerosol, intravenous, oral or topical carrier. 
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47. A transgenic non-human mammal whose somatic and 

germ cells contain and express a DNA coding for a 
chimeric protein of claim 1, the DNA having been 
stably introduced into the non-human mammal at the 
single cell stage or an embryonic stage, and 
wherein the DNA is linked to a promoter and 
integrated into the genome of the non-human mammal. 
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FIGURE 5 
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FIGURE 6 



pLS, An Expression Construct to be Used in the Selection of 
DNA Binding Protein/DNA Methyltransferase Chimeras 
That Methylate Predetermined Sequences 
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FIGURE 9 

Phage-display selection of Zinc-Finger Proteins that Bind to 
Predetermined Sequences in the HIV-1 5' LTR 
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FIGURE 118 

Endonuclease: ^° ^> cf ^ 
Methylating agent: f- j — + 1 _ + | j_ 1 _ +1 
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FIGURE 13 
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FIGURE 14 
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FIGURE 16 
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